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STREPTOCOCCUS PNEUMONIAE PROTEINS AND 

/ 

IMMUNOGENIC FRAGMENTS^FOR VACCINES 



This application is based on U.S. Provisional Application No. 
60/113,048, filed 21 December 1998, which is hereby incorporated in its 
entirety. 

FIELD OF THE INVENTION 

This invention relates generally to the field of bacterial antigens and their 
use, for example, as immunogenic agents in humans and animals to stimulate 
an immune response. More specifically, it relates to the vaccination of 
mammalian species with a polypeptide comprising at least one conserved 
histidine triad residue (HxxHxH) and at least one helix-forming polypeptide 
obtained from Streptococcus pneumoniae as a mechanism for stimulating 
production of antibodies that protect the vaccine recipient against infection by 
a wide range of serotypes of pathogenic 5. pneumoniae. Further, the 
invention relates to antibodies against such polypeptides useful in diagnosis 
and passive immune therapy with respect to diagnosing and treating such 
pneumococcal infections. 

In a particular aspect, the present invention relates to the prevention and 
treatment of pneumococcal infections such as infections of the middle ear, 
nasopharynx, lung and bronchial areas, blood, CSF, and the like, that are 
caused by pneumococcal bacteria. 
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BACKGROUND OF THE INVENTION 

Streptococcus pneumoniae is a gram positive bacteria which is a major 
causative agent in invasive infections in animals and humans, such as sepsis, 
meningitis, otitis media and lobar pneumonia (Tuomanen et al. New Engl. J. 
Med. 322:1280-1284 (1995,,. As part of the infective process, pneumococci 
readily bind to non-inflamed human epithelial cells of the upper and lower 
respiratory tract by binding to eukaryotic carbohydrates in a lectin-like manner 
(Cundell et al., Micro. Path. 17:361-374 (1994,). Conversion to invasive 
pneumococcal infections for bound bacteria may involve the local generation of 
inflammatory factors which may activate the epithelial cells to change the 
number and type of receptors on their surface (Cundell et al., Nature. 
377:435-438 (1995,,. Apparently, one such receptor, platelet activating factor 
(PAR is engaged by the pneumococcal bacteria and within a very short period 
15 of time (minutes, from the appearance of PAF, pneumococci exhibit strongly 
enhanced adherence and invasion of tissue. Certain soluble receptor analogs 
have been shown to prevent the progression of pneumococcal infections 
(Idanpaan-Heikkila et al., J. Inf. Dis., 176:704-712 (1997,,. A number of 
various other proteins have been suggested as being involved in the 
pathogenicity of S. pneumoniae. There remains a need for identifying 
polypeptides having epitopes in common from various strains of S. 
pneumoniae in order to utilize such polypeptides as vaccines to provide 
protection against a wide variety of S. pneumoniae. 
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SUMMARY OF INVENTION 

In accordance with the present invention, there is provided vaccines and 
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vaccine compositions that include polypeptides obtained from S. pneumoniae 
and/or variants of said polypeptides and/or active fragments of such 
polypeptides. 

5 The active fragments, as hereinafter defined, include a histidine triad 

residue(s) and/or coiled coil regions of such polypeptides. 

The term "percent identity" or "percent identical," when referring to a 
sequence, means that a sequence is compared to a claimed or described 
10 sequence from an alignment of the sequence to be compared (the "Compared 
Sequence") with the described or claimed sequence (the "Reference 
Sequence"). The percent identity is determined as follows: 

Percent Identity = (1- (C/R)] 100 

15 

wherein C is the number of differences between the Reference Sequence and 
the Compared Sequence over the length of the alignment between the 
Compared Sequence and the Reference Sequence wherein (i) each base or 
amino acid in the Reference Sequence that does not have an aligned base or 

20 amino acid in the Compared Sequence and (ii) each gap in the Reference 
Sequence and (iii) each aligned base or amino acid in the Reference Sequence 
that is different from an aligned base or amino acid in the Compared Sequence, 
each being a difference; and R is the number of bases or amino acids in the 
Reference Sequence over the length of the alignment with the Compared 

25 Sequence with any gap created in the Reference Sequence also being counted 
as a base or amino acid. 

If an alignment exists between the Compared Sequence and the 
Reference Sequence in which the Percent Identity as calculated above is about 
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equal to or greater than a specified minimum Percent Identity than the 
Compared Sequence has the specified minimum Percent Identity to the 
Reference Sequence even though alignments may exist in which the 
hereinabove calculated Percent Identity is less than the specified Percent 
5 Identity. 

"Isolated" in the context of the present invention with respect to 
polypeptides and/or polynucleotides means that the material is removed from 
its original environment (e.g., the natural environment if it is naturally 
occurring). For example, a naturally-occurring polynucleotide or polypeptide 
present in a living organism is not isolated, but the same polynucleotide or 
polypeptide, separated from some or all of the co-existing materials in the 
natural system, is isolated. Such polynucleotides could be part of a vector 
and/or such polynucleotides or polypeptides could be part of a composition, 
and still be isolated in that such vector or composition is not part of its natural 
environment. The polypeptides and polynucleotides of the present invention 
are preferably provided in an isolated form, and preferably are purified to 
homogeneity. 
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BRIEF DESCRIPTION OF DRAWINGS 

Figures 1A-1C, respectively, report the results of three experiments 
using different preparations of SP36. The results demonstrate that active 
immunization with recombinant SP36 derived from pneumococcal strain 
Norway serotype 4 is able to protect mice from death in a model of 
pneumococcal sepsis using a heterologous strain, SJ2 (serotype 6B). In each 
of the three experiments shown, one hundred percent of the mice immunized 
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with SP36 survived for the 14-day observation period following challenge 
with approximately 500 cfu of pneumococci, while eighty to one hundred 
percent of sham-immunized mice (injected with PBS and adjuvant) died 
during the same period. 

5 

Figures 2A-2B show that passive administration of rabbit antiserum 
raised against Sp36 derived from Norway type 4 was able to protect mice in 
the pneumococcal sepsis model using two heterologous strains. Figure 2A 
shows that one hundred percent of the mice immunized with the SP36 

10 antiserum survived the 21 -day observation period after challenge with 172 
CFU of strain SJ2 (serotype 6B). Eighty percent of the mice immunized 
with a control serum (rabbit anti-FimC) died by day 8, and ninety percent 
died by day 12. Figure 2B shows that 90 percent of the mice immunized 
with the Sp36 antiserum survived the 8-day observation after challenge with 

15 862 CFU of strain EF6796 (serotype 6A). Ninety percent of the mice 
immunized with a control serum (collected before immunization) died by day 
5. 

Figure 3 is a western blot demonstrating the ability of antisera raised 
20 against recombinant Sp36 derived from strain Norway type 4 to react with 
Sp36 of heterologous strains. Total cell lysates were immunoblotted with 
mouse antisera to Sp36. A band representing Sp36 protein was detected in 
all 23 S. pneumoniae strains tested, which included isolates from each of the 
23 pneumococcal serotypes represented in the current polysaccharide 
25 vaccine. 

Figure 4 is a Southern blot showing that the Sp36 gene from Norway 
type 4 hybridizes with genomic DNA from 24 other pneumococcal strains, 
indicating the presence of similar sequences in all these strains. 
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Figure 5 is a western blot showing the reactivity of. patient sera with 
Sp36. Sp36 (either full-length, panel A; N-terminal half, panel B; or C- 
terminal half, panel C) was electrophoresed by SDS-PAGE and transferred to 
5 nitrocellulose. Patient sera collected soon after the onset of illness (acute 
serum, lanes A) or eight to 30 days later (convalescent serum, lanes C) were 
used to probe the blots. For patients 2, 3, and 5, convalescent serum 
reacted more strongly with Sp36 than did the corresponding acute serum. 

10 Figure 6 is an amino acid alignment ; comparison of four related 

pneumococcal proteins, namely Sp36A (PhtA; SEQ ID NO:8), Sp36B (PhtB; 
SEQ ID NO:10), Sp36D (PhtD; SEQ ID NO:4), Sp36E (PhtE; SEQ ID NO:6), 
respectively. Dashes in a sequence indicate gaps introduced to maximize the 
sequence similarity. Amino acid residues that match are boxed. 

15 

Figure 7 is a nucleotide alignment comparison of four related 
pneumococcal genes, namely Sp36A (PhtA; SEQ ID NO:9), Sp36B (PhtB; 
SEQ ID NO:11), Sp36D (PhtD; SEQ ID NO:5), Sp36E (PhtE; SEQ ID NO:7), 
respectively. Dashes in a sequence indicate gaps introduced to maximize the 
2 0 sequence similarity. 

Figure 8 shows the results of immunization of mice with PhtD 
recombinant protein, which leads to protection from lethal sepsis. C3H/HeJ 
(Panel A and B) or Balb/cByJ (Panel C) mice were immunized subcutaneously 
25 with PhtD protein (15 jig in 50 fil PBS emulsified in 50 \x\ complete Freund's 
adjuvant (CFA)). The recombinant PhtD protein used in protection 
experiments consisted of 819 amino acid residues, starting with the cysteine 
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(residue 20). A group of 10 sham-immunized mice received PBS with 
adjuvant. A second immunization of 1 5 jag protein with incomplete Freund's 
adjuvant (I FA) was administered 3 weeks later; the sham group received PBS 
with IFA. Blood was drawn (retro-orbital bleed) at week 7; and sera from 
5 each group was pooled for analysis of anti-PhtD antibody by ELISA. Mice 
were challenged at week 8 by an intraperitonial (i.p.) injection of 
approximately 550 CFU S. pneumoniae strain SJ2, serotype 6B (Panel A), 
850 CFU of strain EF6796, serotype 6A (Panel B) or 450 CFU of strain 
EF5668, serotype 4 (Panel C). In preliminary experiments, the LD 50 for strain 

10 SJ2 and EF6796 were determined to be approximately 10 CFU for both 
strains. The LD 50 for strain EF5668 was determined to be < 5 CFU. Survival 
was determined in all groups over the course of 15 days following challenge. 
Data are presented as the percent survival for a total of 10 mice per 
experimental group. Two-sample Log-rank test was used for statistical 

15 analysis comparing recombinant Pht immunized mice to sham-immunized 
mice. 



SUMMARY OF THE INVENTION 

20 

In accordance with one aspect of the present invention, there is 
provided a vaccine, generally in the form of a composition, that includes at 
least one polypeptide that is at least 90% identical to (c) a polypeptide 
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sequence comprising amino acids 1-819 of SEQ ID N0:4 or (ii) a polypeptide 
sequence comprising amino acids 1-460 of SEQ ID NO:6 or an active fragment 
of the foregoing. 

In accordance with another aspect of the present invention, there is 
provided a vaccine, generally in the form of a composition, that includes an 
active fragment of a polypeptide that is at least 90% identical to (i) a 
polypeptide comprising amino acids 1-800 of SEQ ID NO:8 or (ii) a polypeptide 
comprising amino acids 1-800 of SEQ ID NO: 10. 

The term "active fragment" means a fragment that includes one or more 
histidine triad residues and/or one or more coiled coil regions. A "histidine 
triad residue" is the portion of the polypeptide that has the sequence HxxHxH 
wherein H is histidine and x is an amino acid other than histidine 

A coiled coil region is the region predicted by "Coils" algorithm: Lupas, 
A., Van Dyke, M., and Stock, J. (1991) Predicting Coiled Coils from Protein 
Sequences, Science 252:1 1 62-1 1 64. 

In accordance with one embodiment, the active fragment includes both 
one or more histidine triad residues and at least one coiled coil region of the 
applicable polypeptide sequence. In accordance with another embodiment, the 
active fragment includes at least two histidine triad residues. 

In another embodiment, the active fragment that includes at least one 
histidine triad residue or at least one coiled-coil region of the applicable 
polypeptide includes at least about ten percent of the applicable polypeptide 
and no more than about 85% of the applicable polypeptide. 
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The polypeptide of SEQ ID NO:4 includes five histidine triad residues, as 
follows: 

amino acids 64-69; 188-193; 296-301; 541-546; and 625-630. 

5 

The polypeptide of SEQ ID NO:6 includes five histidine triad residues, as 
follows: 

amino acids 63-68; 185-190; 289-294, 376-381; and 441-446. 

10 

In addition, the polypeptide of SEQ ID N0:4 includes two coiled-coil 
regions (amino acids 120-140 and amino acids 750-772) and the polypeptide 
of SEQ ID NO:6 includes one coiled-coil region (amino acids 1 1 9-1 52). 

15 The polypeptide of SEQ ID NO: 8 includes the following regions: 

HxxHxH: amino acids 63-68, 189-194, 309-314, 550-555, 634-639. 
Coiled-coils: amino acids 1 18-145, 406-434, 462-493, 724-751 . 

2 0 In accordance with a further aspect of the invention, a vaccine of the 

type hereinabove described is administered for the purpose of preventing or 
treating infection caused by S. pneumoniae. 

A vaccine, or vaccine composition, in accordance with the present 
2 5 invention may include one or more of the hereinabove described polypeptides 
or active fragments thereof. When employing more than one polypeptide or 
active fragment, such two or more polypeptides and/or active fragments may 
be used as a physical mixture or as a fusion of two or more polypeptides or 
active fragments. The fusion fragment or fusion polypeptide may be produced, 
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for example, by recombinant techniques or by the use of appropriate linkers for 
fusing previously prepared polypeptides or active fragments. 

In an embodiment of the invention, there is provided (a) a polypeptide 
that is at least 95% identical or at least 97% identical or 100% identical to (i) 
a polypeptide sequence comprising amino acids 1 to 819 of SEQ ID NO:4 or 
(ii) a polypeptide sequence comprising amino acids 1-460 of SEQ ID NO:6; or 
(b) an active fragment of the polypeptide of (a). 

In the case where the polypeptide is a variant of the polypeptide 
comprising the mature polypeptide of SEQ ID NO:4 or SEQ ID NO:6, or any of 
the active fragments of the invention, the variation in the polypeptide or 
fragment is generally in a portion thereof other than the histidine triad residues 
and the coiled-coil region, although variations in one or more of these regions 
may be made. 

In many cases, the variation in the polypeptide or active fragment is a 
conservative amino acid substitution, although other substitutions are within 
the scope of the invention. 

In accordance with the present invention, a polypeptide variant includes 
variants in which one or more amino acids are substituted and/or deleted 
and/or inserted. 

In another aspect, the invention relates to passive immunity vaccines 
formulated from antibodies against a polypeptide or active fragment of a 
polypeptide of the present invention. Such passive immunity vaccines can be 
utilized to prevent and/or treat pneumococcal infections in patients. In this 
manner, according to a further aspect of the invention, a vaccine can be 
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produced from a synthetic or recombinant polypeptide of the present invention 
or an antibody against such polypeptide. 

In still another aspect the present invention relates to a method of using 
5 one or more antibodies (monoclonal, polyclonal or sera) to the polypeptides of 
the invention as described above for the prophylaxis and/or treatment of 
diseases that are caused by pneumococcal bacteria. In particular, the 
invention relates to a method for the prophylaxis and/or treatment of infectious 
diseases that are caused by S. pneumoniae. In a still further preferred aspect, 
10 the invention relates to a method for the prophylaxis and/or treatment of otitis 
media, nasopharyngeal, bronchial infections, and the like in humans by utilizing 
a vaccine of the present invention. 

Generally, vaccines are prepared as injectables, in the form of aqueous 
15 solutions or suspensions. Vaccines in an oil base are also well known such as 
for inhaling. Solid forms which are dissolved or suspended prior to use may 
also be formulated. Pharmaceutical carriers are generally added that are 
compatible with the active ingredients and acceptable for pharmaceutical use. 
Examples of such carriers include, but are not limited to, water, saline 
20 solutions, dextrose, or glycerol. Combinations of carriers may also be used. 

Vaccine compositions may further incorporate additional substances to 
stabilize pH, or to function as adjuvants, wetting agents, or emulsifying 
agents, which can serve to improve the effectiveness of the vaccine. 

25 

Vaccines are generally formulated for parental administration and are 
injected either subcutaneously or intramuscularly. Such vaccines can also be 
formulated as suppositories or for oral administration, using methods known in 
the art. 
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The amount of vaccine sufficient to confer immunity to pathogenic 
bacteria is determined by methods well known to those skilled in the art. This 
quantity will be determined based upon the characteristics of the vaccine 
5 recipient and the level of immunity required. Typically, the amount of vaccine 
to be administered will be determined based upon the judgment of a skilled 
physician. Where vaccines are administered by subcutaneous or intramuscular 
injection, a range of 50 to 500 ng purified protein may be given. 

10 The present invention is also directed to a vaccine in which a 

polypeptide or active fragment of the present invention is delivered or 
administered in the form of a polynucleotide encoding the polypeptide or active 
fragment, whereby the polypeptide or active fragment is produced in vivo. 
The polynucleotide may be included in a suitable expression vector and 

15 combined with a pharmaceutical^ acceptable carrier. 

In addition, the polypeptides of the present invention can be used as 
immunogens to stimulate the production of antibodies for use in passive 
immunotherapy, for use as diagnostic reagents, and for use as reagents in 
20 other processes such as affinity chromatography. 

In another aspect the present invention provides polynucleotides which 
encode the hereinabove described polypeptides and active fragments of the 
invention. The polynucleotide of the present invention may be in the form of 
25 RNA or in the form of DNA, which DNA includes cDNA, genomic DNA, and 
synthetic DNA. The DNA may be double-stranded or single-stranded, and if 
single stranded may be the coding strand or non-coding (anti-sense) strand. 

In accordance with another aspect of the present invention, there is 
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provided 

(A) an isolated polynucleotide that is at least 90% identical to a 
polynucleotide sequence encoding (i) a polypeptide comprising amino acids 1- 
81 9 of SEQ ID NO:4 or <ii) a polypeptide comprising amino acids 1-460 of SEQ 
ID NO:6, or 

(B) a fragment of the polynucleotide of (A) that encodes an active 
polypeptide fragment or 

(C) a polynucleotide that is at least 90% identical to a 
polynucleotide sequence encoding an active fragment of (i) a polypeptide 
comprising amino acids 1 to 800 of SEQ ID NO:8 or (ii) a polypeptide 
comprising amino acids 1 to 800 of SEQ ID NO:10. 

In specific embodiments, the polynucleotide is at least 95% identical, 
preferably at least 97% identical, and even 100% identical to such 
polynucleotide sequence. 

The term "polynucleotide encoding a polypeptide" encompasses a 
polynucleotide which includes only coding sequence for the polypeptide as well 
as a polynucleotide which includes additional coding and/or non-coding 
sequence. 

The present invention further relates to variants of polynucleotides. The 
variants of the polynucleotides may be a naturally occurring allelic variant of 
the polynucleotides or a non-naturally occurring variant of the polynucleotides. 
The variants include variants in which one or more bases are substituted, 
deleted or inserted. Complements to such coding polynucleotides may be 
utilized to isolate polynucleotides encoding the same or similar polypeptides. 
In particular, such procedures are useful to obtain native immunogenic portions 
of polypeptides from different serotypes of S. pneumoniae, which is especially 
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useful in the production of "chain" polypeptide vaccines containing multiple 
immunogenic segments. 

SEQ ID NO:5 is a representative example of a polynucleotide encoding 
5 the polypeptide of SEQ ID N0:4 and SEQ ID NO:7 is a representative example 
of a polynucleotide encoding the polypeptide of SEQ ID NO:6. SEQ ID NO:9 is 
a representative example of a polynucleotide encoding the polypeptide of SEQ 
ID NO:8, and SEQ ID NO: 11 is a representative example of a polynucleotide 
encoding the polypeptide of SEQ ID NO: 10. As a result of the known 
10 degeneracy of the genetic code, other polynucleotides that encode the 
polypeptides of SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID NO: 10 
should be apparent to those skilled in the art from the teachings herein. 

The polynucleotides encoding the immunogenic polypeptides described 
15 above may also have the coding sequence fused in frame to a marker 
sequence which allows for purification of the polypeptides of the present 
invention. The marker sequence may be, for example, a hexa-histidine tag 
supplied by a pQE-9 vector to provide for purification of the mature 
polypeptides fused to the marker in the case of a bacterial host, or, for 
20 example, the marker sequence may be a hemagglutinin (HA) tag when a 
mammalian host, e.g. COS-7 cells, is used. The HA tag corresponds to an 
epitope derived from the influenza hemagglutinin protein (Wilson, I., et al., Cell, 
37:767 (1984)). 

25 The present invention also relates to vectors which include 

polynucleotides encoding one or more of the polypeptides of the invention, 
host cells which are genetically engineered with vectors of the invention and 
the production of such immunogenic polypeptides by recombinant techniques 
in an isolated and substantially immunogenically pure form. 

14 
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Host cells are genetically engineered (transduced or transformed or 
transfected) with the vectors comprising a polynucleotide encoding a 
polypeptide of the invention. The vector may be, for example, in the form of a 
5 plasmid, a viral particle, a phage, etc. The engineered host cells can be 
cultured in conventional nutrient media modified as appropriate for activating 
promoters, selecting transformants or amplifying the polynucleotides which 
encode such polypeptides. The culture conditions, such as temperature, pH 
and the like, are those previously used with the host cell selected for 
10 expression, and will be apparent to the ordinarily skilled artisan. 

Vectors include chromosomal, nonchromosomal and synthetic DNA 
sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA; 
baculovirus; yeast plasmids; vectors derived from combinations of plasmids 
15 and phage DNA, viral DNA such as vaccinia, adenovirus, fowl pox virus, and 
pseudorabies. However, any other vector may be used as long as it is 
replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a 
20 variety of procedures. In general, the DNA sequence is inserted into an 
appropriate restriction endonuclease site(s) by procedures known in the art. 
Such procedures and others are deemed to be within the scope of those skilled 
in the art. 

25 The DNA sequence in the expression vector is operatively linked to an 

appropriate expression control sequence(s) (promoter) to direct mRNA 
synthesis. As representative examples of such promoters, there may be 
mentioned: LTR or SV40 promoter, the E. coli. lac or trp, the phage lambda P L 
promoter and other promoters known to control expression of genes in 

15 
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prokaryotic or eukaryotic cells or their viruses. The expression vector also 
contains a ribosome binding site for translation initiation and a transcription 
terminator. The vector may also include appropriate sequences for amplifying 
expression. 

In addition, the expression vectors preferably contain one or more 
selectable marker genes to provide a phenotypic trait for selection of 
transformed host cells such as dihydrofolate reductase or neomycin resistance 
for eukaryotic cell culture, or such as tetracycline or ampicillin resistance in E. 
coli. 

The vector containing the appropriate DNA sequence as hereinabove 
described, as well as an appropriate promoter or control sequence, may be 
employed to transform an appropriate host to permit the host to express the 
proteins. 

As representative examples of appropriate hosts, there may be 
mentioned: bacterial cells, such as E. coli , Streptomyces , Salmonella 
typhimurium ; fungal cells, such as yeast; insect cells such as Drosophila S2 
and Spodoptera Sf9 ; animal cells such as CHO, COS or Bowes melanoma; 
adenoviruses; plant cells, etc. The selection of an appropriate host is deemed 
to be within the scope of those skilled in the art from the teachings herein. 

More particularly, the present invention also includes recombinant 
constructs comprising one or more of the sequences as broadly described 
above. The constructs comprise a vector, such as a plasmid or viral vector, 
into which a sequence of the invention has been inserted, in a forward or 
reverse orientation. In a preferred aspect of this embodiment, the construct 
further comprises regulatory sequences, including, for example, a promoter, 
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operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially 
available. The following vectors are provided by way of example. Bacterial: 
pQE70, pQE60, pQE-9 (Qiagen, Inc.), pbs, pD10, phagescript, psiX174, 
5 pbluescript SK, pbsks, pNH8A, pNH16a, pNH18A, pNH46A (Stratagene); 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: 
pWLNEO, pSV2CAT, pOG44, pXT1, pSG (Stratagene) pSVK3, pBPV, pMSG, 
pSVL (Pharmacia). However, any other plasmid or vector may be used as long 
as they are replicable and viable in the host. 

10 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. 
Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include lad, lacZ, T3, T7, gpt, lambda P R , P L and TRP. Eukaryotic 
15 promoters include CMV immediate early, HSV thymidine kinase, early and late 
SV40, LTRs from retrovirus, and mouse metallothionein-L Selection of the 
appropriate vector and promoter is well within the level of ordinary skill in the 
art. 

20 In a further embodiment, the present invention relates to host cells 

containing the above-described constructs. The host cell can be a higher 
eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such as a 
yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. 
Introduction of the construct into the host cell can be effected by calcium 

25 phosphate transfection, DEAE-Dextran mediated transfection, or 
electroporation (Davis, L, Dibner, M., Battey, I., Basic Methods in Molecular 
Biology, (1986)). 

The constructs in host cells can be used in a conventional manner to 
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produce the gene product encoded by the recombinant sequence. 
Alternatively, the polypeptides of the invention can be synthetically produced 
by conventional peptide synthesizers. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, 
or other cells under the control of appropriate promoters. Cell-free translation 
systems can also be employed to produce such proteins using RNAs derived 
from the DNA constructs of the present invention. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are described 
by Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second Edition, 
Cold Spring Harbor, N.Y., (1989), the disclosure of which is hereby 
incorporated by reference. 

Transcription of the DNA encoding the polypeptides of the present 
invention by higher eukaryotes is increased by inserting an enhancer sequence 
into the vector. Enhancers are cis-acting elements of DNA, usually about from 
10 to 300 bp that act on a promoter to increase its transcription. Examples 
including the SV40 enhancer on the late side of the replication origin bp 100 to 
270, a cytomegalovirus early promoter enhancer, the polyoma enhancer on the 
late side of the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will include origins of 
replication and selectable markers permitting transformation of the host cell, 
e.g., the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and 
a promoter derived from a highly-expressed gene to direct transcription of a 
downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), ex- 
factor, acid phosphatase, or heat shock proteins, among others. The 
heterologous structural sequence is assembled in appropriate phase with 
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translation initiation and termination sequences. Optionally, the heterologous 
sequence can encode a fusion protein including an N-terminal identification 
peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting 
a structural DNA sequence encoding a desired protein together with suitable 
translation initiation and termination signals in operable reading phase with a 
functional promoter. The vector will comprise one or more phenotypic 
selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host. Suitable 
prokaryotic hosts for transformation include E. coli , Bacillus subtilis , Salmonella 
typhimurium and various species within the genera Pseudomonas, 
Streptomyces, and Staphylococcus, although others may also be employed as 
a matter of choice. 

As a representative but nonlimiting example, useful expression vectors 
for bacterial use can comprise a selectable marker and bacterial origin of 
replication derived from commercially available plasmids comprising genetic 
elements of the well known cloning vector pBR322 (ATCC 37017). Such 
commercial vectors include, for example, pKK223-3 (Pharmacia Fine 
Chemicals, Uppsala, Sweden) and GEM1 (Promega Biotec, Madison, Wl, USA). 
These pBR322 "backbone" sections are combined with an appropriate 
promoter and the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the 
host strain to an appropriate cell density, the selected promoter is induced by 
appropriate means (e.g., temperature shift or chemical induction) and cells are 
cultured for an additional period. 
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Cells are typically harvested by centrifugation, disrupted by physical or 
chemical means, and the resulting crude extract retained for further 
purification. 

Microbial cells employed in expression of proteins can be disrupted by 
any convenient method, including freeze-thaw cycling, sonication, a french 
press, mechanical disruption, or use of cell lysing agents, such methods are 
well know to those skilled in the art. However, preferred are host cells which 
secrete the polypeptide of the invention and permit recovery of the polypeptide 
from the culture media. 

Various mammalian cell culture systems can also be employed to 
express recombinant protein. Examples of mammalian expression systems 
include the COS-7 lines of monkey kidney fibroblasts, described by Gluzman, 
Cell, 23:175 (1981), and other cell lines capable of expressing a compatible 
vector, for example, the C127, 3T3, CHO, HeLa and BHK cell lines. 
Mammalian expression vectors will comprise an origin of replication, a suitable 
promoter and enhancer, and also any necessary ribosome binding sites, 
polyadenylation site, splice donor and acceptor sites, transcriptional 
termination sequences, and 5' flanking nontranscribed sequences. DNA 
sequences derived from the SV40 splice, and polyadenylation sites may be 
used to provide the required nontranscribed genetic elements. 

The polypeptides can be recovered and/or purified from recombinant cell 
cultures by well-known protein recovery and purification methods. Such 
methodology may include ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity 
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chromatography, hydroxylapatite chromatography and lectin chromatography. 
Protein refolding steps can be used, as necessary, in completing configuration 
of the mature protein. In this respect, chaperones may be used in such a 
refolding procedure. Finally, high performance liquid chromatography (HPLC) 
can be employed for final purification steps. 

The polypeptides that are useful as immunogens in the present invention 
may be a naturally purified product, or a product of chemical synthetic 
procedures, or produced by recombinant techniques from a prokaryotic or 
eukaryotic host (for example, by bacterial, yeast, higher plant, insect and 
mammalian cells in culture). Depending upon the host employed in a 
recombinant production procedure, the polypeptides of the present invention 
may be glycosylated or may be non-glycosylated. 

Procedures for the isolation of the individually expressed polypeptides 
may be isolated by recombinant expression/isolation methods that are well- 
known in the art. Typical examples for such isolation may utilize an antibody 
to a conserved area of the protein or to a His tag or cleavable leader or tail that 
is expressed as part of the protein structure. 

The polypeptides, their fragments or other derivatives, or analogs 
thereof, or cells expressing them can be used as an immunogen to produce 
antibodies thereto. These antibodies can be, for example, polyclonal or 
monoclonal antibodies. The present invention also includes chimeric, single 
chain, and humanized antibodies, as well as Fab fragments, or the product of 
an Fab expression library. Various procedures known in the art may be used 
for the production of such antibodies and fragments. 

Antibodies generated against the polypeptides corresponding to a 
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sequence of the present invention can be obtained by direct injection of the 
polypeptides into an animal. 

For preparation of monoclonal antibodies, any technique which provides 
antibodies produced by continuous cell line cultures can be used. Examples 
include the hybridoma technique (Kohler and Milstein, 1975, Nature, 256:495- 
497), the trioma technique, the human B-cell hybridoma technique (Kozbor et 
al., 1983, Immunology Today 4:72), and the EBV-hybridoma technique to 
produce human monoclonal antibodies (Cole, et al., 1985, in Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. 
Patent 4,946,778) can be adapted to produce single chain antibodies to 
immunogenic polypeptide products of this invention. Also, transgenic mice 
may be used to express humanized antibodies to immunogenic polypeptide 
products of this invention. 

The invention will be further described with respect to the following 
examples; however, the scope of the invention is not limited thereby: 

Example 1 

Active Protection with Anti-Sp36 

A Cloning, expression, and purification of SP36 

The genomic DNA used as target for amplification was isolated from 
S. pneumoniae Norway strain (serotype 4), the same strain used for genomic 
sequencing. The complete sequence of the Sp36 gene (SEQ ID NO:9), and 
its predicted amino acid sequence (SEQ ID NO:8), are given in the Sequence 
Listing appended hereto. It was noted that the predicted amino acid 
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sequence included a hydrophobic leader sequence followed by a sequence 
(LSVC) similar to the consensus sequence for Type II signal peptidase (LxxC, 
in which both x's typically represent smalt amino acids). Primers (listed as 
SEQ ID NOS:1-3) were designed that would amplify the Sp36 gene and allow 
5 its cloning into pQE10 and expression as a histidine-tagged protein lacking 
the signal sequence for purification by nickel-affinity chromatography. 
Cloning of the fragment amplified by SEQ ID Nos 1 and 3 would result in a 
protein containing amino acids 2 through 800 of Sp36; cloning of the 
fragment amplified by SEQ ID Nos 2 and 3 would result in a protein 
10 containing amino acids 7 through 800 of Sp36 (amino acid numbers refer to 
SEQ ID NO:8). 

B. Active Protection With Sp36 Vaccination 

15 in each of the three experiments shown in Figures 1A-1C, C3H/HeJ 

mice (10/group) were immunized intraperitoneally (i.p.) with Sp36 protein 
(15 |ag in 50 jil PBS emulsified in 50 yi\ complete Freund's adjuvant (CFA)). A 
group of 10 sham-immunized mice received PBS with adjuvant. A second 
immunization of 15 jig protein with incomplete Freund's adjuvant (IFA) was 

20 administered 4 weeks later; the sham group received PBS with IFA. Blood 
was drawn (retro-orbital bleed) at weeks 3, 6, and 9; and sera from each 
group were pooled for analysis of anti-Sp36 antibody by ELISA. Mice were 
challenged at week 10 by an i.p. injection of approximately 500 CFU S. 
pneumoniae strain SJ2 (serotype 6B; provided by P. Flynn, St. Jude 

25 Children's Research Hospital, Memphis, TN). In preliminary experiments, the 
LD 50 of this strain was determined to be approximately 10 CFU. Mice were 
monitored for 14 days for survival. 

The three experiments shown in Figures 1A-1C used slightly different 
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preparations of recombinant Sp36. The experiments shown in Figure 1A and 
IB both used Sp36 containing amino acids 20-815, but different batches of 
protein were used in the two experiments. The experiment shown in Figure 
1C used Sp36 containing amino acids 25-815. 

5 

In the experiment shown in Figure 1A, 9-week sera collected from the 
ten mice immunized with Sp36 (first batch) had an endpoint ELISA titer of 
1:4,096,000. No anti-Sp36 antibody was detected in sera from sham- 
immunized mice. One hundred percent of the mice immunized with Sp36 
10 protein survived the challenge (520 cfu of pneumococci) for 14 days. Eighty 
percent of sham-immunized mice were dead by day 4, and the remainder 
survived. 

In the experiment shown in Figure 1 B, 9-week sera collected from the 
15 ten mice immunized with Sp36 (second batch) had an endpoint ELISA titer of 
> 1 :4,096,000. No anti-Sp36 antibody was detected in sera from sham- 
immunized mice. One hundred percent of the mice immunized with Sp36 
protein survived the challenge (510 cfu of pneumococci) for 14 days. Of the 
sham-immunized mice, eighty percent were dead by day 4, and all died by 
20 day 9. 

In the experiment shown in Figure 1 C, 9-week sera collected from the 
ten mice immunized with Sp36 (containing amino acids 25- 815) had an 
endpoint ELISA titer of 1 :4,096,000. No anti-Sp36 antibody was detected in 
25 sera from sham-immunized mice. One hundred percent of the mice 
immunized with Sp36 protein survived the challenge (510 cfu of 
pneumococci) for 14 days. Of the sham-immunized mice, ninety percent died 
by day 4, and all died by day 12. These data demonstrate that immunization 
of mice with recombinant Sp36 proteins elicits a response capable of 
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protecting against systemic pneumococcal infection and death. This 
protection was not strain-specific: the recombinant pneumococcal prote.n 
was cloned from a serotype 4 strain, while the challenge was w.th a 
heterologous strain, SJ2 {serotype 6B). 

Example 2 

Passive Protection with Anti-S p36 Antisera 

A. Generation of Rabbit Immune Sera 

Following collection of preimmune serum, a New Zealand White rabbit 
was immunized with 250 ^g of Sp36 (containing amino acids 20-815) in 
CFA. The rabbit was given two boosts of 125 ug Sp36 in I FA on days 29 
and 50 and bled on days 39 and 60. A second rabbit was immunized with a 
control antigen, E. coli FimC. 

B. Passive Protection in Mice 

C3H/HeJ mice (10 mice/group) were passively immunized by two i.p. 
injections of 100 ul of rabbit serum. The first injection was administered 
twenty-four hours before challenge with 172 cfu of S. pneumoniae strain 
SJ2, and the second injection was given four hours after challenge. Figure 2 
shows the survival of mice after infection with two different strains of 
pneumococci. 

Figure 2A shows that of mice injected with 172 cfu of strain SJ2 
(Figure 2A), one hundred percent of the mice immunized with rabbit immune 
serum raised against Sp36 protein survived the 21 -day observation period. 
Of the mice immunized with the control serum (anti-FimC), eighty percent 
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died by day 8, and ninety percent died by day 12. Figure 2B shows that of 
mice injected with 862 cfu of strain EF6796, ninety percent of the mice 
immunized with rabbit immune serum raised against Sp36 protein survived 
the 8-day observation period. Of those given a control serum (collected from 
a rabbit before immunization), ninety percent died by day 8. 

These data indicate that the protection against pneumococcal infection 
resulting from immunization with Sp36 is antibody-mediated, since mice can 
be protected by passive transfer of serum from a hyperimmunized rabbit. As 
seen in the mouse active challenge experiments described above, serum 
directed against recombinant Sp36 protein cloned from a serotype 4 strain 
was protective against challenge with heterologous strains. 



Example 3 

Conservation of Sp36 Among Strains of S. pneumoniae 



A. Western blotting 



The 23 pneumococcal strains used in this experiment were obtained 
from the American Type Culture Collection (Rockville, MD) and include one 
isolate each of the 23 serotypes in the multivalent pneumococcal vaccine. 
For total cell lysates, pneumococci were grown to mid-logarithmic phase 
(optical density at 620 nm, 0.4 to 0.6) in 2 ml Todd-Hewitt broth with 0.5% 
yeast extract (Difco, Detroit, ME) at 37°C. Bacteria were harvested by 
centrifugation and washed twice with water. Pellets were resuspended in 
200 \x\ lysis buffer (0.01% sodium dodecyl sulfate, 0.15 M sodium citrate 
and 0.1% sodium deoxycholate) and incubated at 37°C for 30 min, then 
diluted in an equal volume 2x SSC (0.3 M sodium chloride, 0.03 M sodium 
citrate). Lysates were separated by SDS-PAGE, transferred to nitrocellulose 
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membranes (Bio-Rad Laboratories. Hercuies, CA), and probed with antibody 
in a standard Western blotting procedure. Sera from ten C3H/HeJ m.ce 
immunized with Sp36 {as described in Example 1) were pooled and used at a 
dilution of 1:3000. Bound antibody was detected with peroxidase- 
conjugated sheep anti- mouse IgG using the chemiluminescence kit from 
Amersham, Inc. (Cambridge, MA). 

The mouse anti-S P 36 sera detected two major bands with apparent 
molecular weights of 97 and 100 kDa in all 23 pneumococcal lysates tested 
(shown in Figure 3). The Sp36 signals obtained from S. pneumoniae 
serotypes 1, 5. 17F and 22F were lower, indicating either that the level of 
Sp36 expression is reduced in these strains, or that Sp36 in these strains is 
antigenically different. 

These data show that Sp36 is antigenically conserved among strains 
of the 23 pneumococcal serotypes represented in the current polysaccharide 



vaccine. 



B. Southern blotting 

Genomic DNA was prepared from each of the 23 pneumococcal 
strains listed in the previous section and also from strain SJ2. DNA was 
digested with PvuW and BamH\, electrophoresed in an agarose gel and 
transferred to a nylon membrane. A probe was prepared by amplifying the 
Sp36 gene from Norway type 4 DNA (as in Example 1 ) and labeling the 
amplified fragment with fluorescein by the random-priming method, using a 
kit from Amersham. Hybridization, washing, and exposure of film were 
carried out as in the protocol supplied by Amersham. Figure 4 shows that 
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the Sp36 probe hybridized with DNA from each of the 24 strains studied. 
The lane marked "M" contained DNA from lambda phage, digested with 
Hind\\\ and labeled with fluorescein, as molecular weight markers. 



Example 4 

Immunogenicity of Sp36 in Humans 

In order to determine whether Sp36 is immunogenic during human 
pneumococcal infection, sera from patients with culture-proven 
pneumococcal bacteremia were used in Western blots containing 
recombinant Sp36 protein. In the experiment shown in Figure 5, sera from 
five patients (indicated as 1 through 5) were diluted 1 :3000 and used to 
probe blots containing full-length Sp36, the N-terminal half of Sp36 
(preceding the proline-rich region), or the C-terminal half of Sp36 (following 
the proline-rich region). Lanes labeled A (acute) were probed with serum 
collected shortly after diagnosis of pneumococcal infection; lanes C 
(convalescent) were probed with serum collected either one month later 
(patients 1 , 2, and 3) or eight days after the first serum collection (patients 4 
and 5). For patients 2, 3 and 5, reactivity of the convalescent serum with 
Sp36 was stronger that that of the corresponding acute serum. The 
difference between the acute and convalescent sera was particularly evident 
for reactivity with the C-terminal half of the protein. 

In additional experiments (not shown), convalescent sera from 23 
patients with pneumococcal infections were tested individually for reactivity 
with full-length Sp36: 20 of the 23 sera were found to bind Sp36 on a 
Western blot. 

These experiments indicate that Sp36 is recognized by the human 
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immune system and suggest that antibodies able to bind the Sp36 protein 
may be produced during natural S. pneumoniae infection in humans. Since 
the patients were infected with a variety of pneumococcal strains, these data 
also support the idea that Sp36 is antigenically conserved. 



Example 5 

Table 1 provides the percent identity between the various sequences. 

Alignment of the predicted amino acid sequences of PhtA, PhtB, PhtD, 
and PhtE using the MEGALIGN program of Lasergene showed strong N- 
terminal homology with substantial divergence of the C-termini (Figure 6). 
The alignment of the nucleotide sequences of the same genes is shown in 
Figure 7. Amino acid and nucleotide sequences were compared using the 
identity weighting in a Lipman-Pearson pairwise alignment, in which the 
number of matching residues is divided by the total of matching residues plus 
the number of mismatched residues plus the number of residues in gaps. In 
the table below, the percent identity between each pair of sequences is 
shown at the intersection of the corresponding row and column. 

Example 6 

Active Protection with PhtD Vaccination. 

Mice immunized with recombinant PhtD derived from strain N4 
generated potent antibody titers (reciprocal endpoint titers ranging form 
2,048,00 to 4,096,000). Mice immunized with PhtD were protected against 
death following intraperitoneal injection with either of three heterologous 
strains, SJ2 (serotype 6B; provided by P. Flynn, St. Jude Children's Research 
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Hospital, Memphis, TN), EF6796 (serotype 6A) or EF5668 (serotype 4; both 
strains provided by D. Briles, University of Alabama, Birmingham). In the 
experiment shown in Figure 8 (Panel A), all ten of the sham-immunized mice 
died within 10-days after challenge with virulent pneumococci (strain SJ2), 
while eighty percent of the PhtD-immunized mice survived the 15-day 
observation period. Immunization with PhtD also protected against a serotype 
6A strain, EF6796 (Panel B) and a serotype 4 strain, EF5668 (Panel C). In 
the experiment shown in Figure 8 (Panel B), all ten of the sham-immunized 
mice died within 7-days after challenge with virulent pneumococci (strain 
EF6796), while ninety percent of the PhtD-immunized mice survived the 15- 
day observation period. In the experiment shown in Figure 8 (Panel C), all ten 
of the sham-immunized mice died within 6-days after challenge with virulent 
pneumoccoci (strain EF5668), while eight of nine mice immunized with PhtD 
survived the 15-day observation period. 
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Table 1 . Percent Identities 




Percent Identity Between Amino Acid Sequences 


PhtA PhtB PhtD 


PhtE 


PhtA — 66.4 63.9 


49.5 


PhtB — 87.2 


49.5 


PhtD 


49.8 


PhtE 




Percent Identity Between Nucleotide Sequences 


PhtA PhtB PhtD 


PhtE 


PhtA — 58.3 59.3 


47.9 


PhtB — 86.4 


47.4 


PhtD 


47.9 


PhtE 





31 



WO 00/37105 , PCT/US99/30390 



WHAT IS CLAIMED IS : 

1 . A vaccine composition comprising: 

(a) at least one member selected from the groups consisting 
5 of (i) a polypeptide comprising a polypeptide sequence that is at least 90% 

identical to amino acids 1-819 of SEQ ID N0:4; (ii) a polypeptide comprising 
a polypeptide sequence that is at least 90% identical to amino acids 1-460 
of SEQ ID NO:6; (iii) a fragment of the polypeptide of (i) that includes at least 
one of a histidine triad residue or coiled-coil region; (iv) a fragment of the 

10 polypeptide of (ii) that includes at least one of a histidine triad residue or a 
coiled-coil region; (v) a fragment of a polypeptide that is at least 90% 
identical to the polypeptide sequence comprising amino acids 1-800 of SEQ 
ID NO:8, wherein said fragment includes at least one of a histidine triad 
residue or coiled-coil region wherein said fragment includes at least 80 amino 

15 acids and no more than 680 amino acids; and (vi) a fragment of a 
polypeptide that is at least 90% identical to the polypeptide sequence 
comprising amino acids 1-800 of SEQ ID NO: 10, wherein said fragment 
includes at least one of a histidine triad residue or coiled-coil region wherein 
said fragment includes at least 80 amino acids and no more than 680 amino 

2 0 acids; and 

(b) a pharmaceutical^ acceptable carrier. 

2. A process for preventing infection caused by S. pneumoniae 
comprising: 

25 administering the vaccine of claim 1 . 

3. A vaccine composition comprising: 

(a) at least one antibody against a member selected from the 
group consistng of (i) a polypeptide comprising a polypeptide sequence that 
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is at least 90% identical to amino acids 1-819 of SEQ ID N0:4; (ii) a 
polypeptide comprising a polypeptide sequence that is at least 90% identical 
to amino acids 1-460 of SEQ ID NO:6; (iii) a fragment of the polypeptide of 
(i) that includes at least one of histidine triad residue or coiled-coil region; (iv) 
a fragment of the polypeptide of (ii) that includes at least one of a histidine 
triad residue or a coiled-coil region; (v) a fragment of a polypeptide that is at 
least 90% identical to the polypeptide sequence comprising amino acids 1- 
800 of SEQ ID NO:8, wherein said fragment includes at least one of a 
histidine triad residue or coiled-coil region wherein said fragment includes at 
least 80 amino acids and no more than 680 amino acids and (vi) a fragment 
of a polypeptide that is at least 90% identical to the polypeptide sequence 
comprising amino acids 1-800 of SEQ ID NO: 10, wherein said fragment 
includes at least one of a histidine triad residue or coiled-coil region wherein 
said fragment includes at least 80 amino acids and no more than 680 amino 
acids. 
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Figure 1 
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Figure 2 
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Figure 6 (a) 
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340 3S0 
— I 1 1- 
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R T A R O V AV.P HQOKYHFXPY 3 QMS 8 L 8 8 R X A R X I nRYRSMHHVPPSRPKQi PhtA.PRO 
HOPKYHFIPYsI k lIsIa Il 8 Bt B ll A Ri M Vlpll PhtE.PRO 
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TCCTATGAGCTTG GA^JTTATCAAOCTGOTCAGGTTAAOAAAGAGTCTAA Majority 
\ I I * I 
in 20 30 40 50 

V ■ » ' ' 

61 T cTTACCAGTTGGGACTGTATCAAGCTAGAACGCTTAAGGAAAA-- -. T A A phtA.SEQ 

1 xcCTATGAOCTTGOACOTTACCAAGCTGCTCAGGATAAGAAAGAGTCTAA phtB. seq 

1 TCCTATGAACTTGGTCGTCACCAAGCTCGTCAGGTTAAGAAAGAGTCTAA phtO.SEQ 

64 GCCTATCCACTAAACCAGCATC--GTTCG-CAGGAAAATAAGGACAATAA phtE.SEQ 

TCCTGTTTCTTA TATAOATOOTOATCAOOCTOOTCAAAAOCCAOAAAACT Majority 
i 1 11* 
60 70 B0 90 100 

* ' 1 

TCGTGTTTCCTATATAGATOGAAAACAAGCGACGCAAAAAACGGAGAATT phtA. SEQ 

__ - _ - _ _ mjb* * « * ft »» ft ft ft ft C» nhrA mmr* 



108 
51 
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111 



TCGAGTTGCTTATATAGATGGTGATCAOGCTGGTCAAAAGOCAGAAAACT phtB. sec, 
TCGAOTTTCTTATATAGATGGTGATCAOGCTGCTCAAAAGGCAGAAAACT phtO.SEQ 
TCGTGTCTCTTATOTOGATGCCAGCCAGTCAAGTCAGAAAAOTCAAAACT ohtE.SEQ 



TGACACCAOATOA QQTTAGTAAOAGGGAOOOGATCA A C G C TOAGCAAATT Majority 

I I 1 



phtE.SEQ 

. . . . 



U0 130 130 140 150 

i5B TGACTCCTGATGAOGTTAOCAAGCOTOAAGOAATCAATOCTOAGCAAATC phtA.SEQ 
101 TGACACCAGATOAAGTCAOTAAOAGGGAOGGOATCAACOCCOAACAAATT phtB. ioq 
TGACACCAOATGAAGTCAGTAAGAGGGAGGGOATCAACGCCGAACAAATC ohto.fiEa 
TGACACCAGACCAOaTTAGCCAOAAAOAAOOAATTCAOOCTOAOCAAATT 



;ACCAGATGAAGTCAaTAAUAt*vwAW\Jww**w*%#ftw*#**-~«--«~--- 

101 TGACACCAOATGAAGTCAGTAAGAGGGAGGGOATCAACGCCGAACAAATC phtO.SEQ 

_ _ _ _ - _ _ - ft » « » t n n % ft«p<i»f»irsnp*rQAacAAATT phtE.SEQ 

GTCATCAAOATTACGOATCAAQOTTATOTOACCTCTCATOGAGA CCATTA Majority 

160 170 180 190 200 

208 GTCATCAAGATAACAOACCAAOOCTATOTCACTTCACATOOCOACCACTA phtA.SEQ 

151 aTTATCAAOATTACOOATCAAOOTTATGTGACCTCTCATaGAOACCATTA phtB.Mq 

151 GTCATCAAGATTACGOATCAAGGTTATOTCACCTCTCATaGAOACCATTA phtO.SEQ 

211 GTAATCAAAATTACAOATCAGOaCTATOTAACOTCACACOGTGACCACTA phtE.SEQ 

TCATTACTATAATQOCAAOOTTCCTTATGATOCCATCATCA QTQAAOAGC Majority 

210 W0 230 2J0 250 

TCATTATTACAATOOTAAOOTTCCTTATGACGCTATCATCAaTGAAG 
_ _ _ _ _ _ _ . _ . » — — *• _ • ftnn*ti«nr»r*«f»*T«s.<f«n&«i»nr > r&.TeATCAGTCAAG 



258 TCATTATTACAATOOTAAOOTTgCTTATaACBUTATtA*v~~*-««-AAT phtA.SEQ 

201 TCATTACTATAATOOCAAOOTTCCTTATOATOCCATCATCAaTGAAGAGC phtB.»*q 

201 TCATTACTATAATGOCAAOaTCCCTTATOATGCCATCATCAGTGAAGAGC phtO.SEQ 

261 TCATTACTATAATOOOAAAOTTCCTTATOATOCCCTCTTTAGTGAAGAAC phtE.SEQ 

TCCTCATOAAAGATCCOAATTAYCAOTTOAAOQATTCAOAT ATTGTCAAT Majority 

* 260 270 280 2$0 300 

308 TACTCATOAAAQATCCAAACTATAAGCTAAAAOATOAOGATATTCTTAAT phtA.SEQ 
251 TCCTCATOAAAOATCCOAATTATCAOTTOAAOOATTCACACATTGTCAAT phtB.aeq 
251 TCCTCATGAAAGATCCOAATTATCAOTTOAAOOATTCAGACATTGTCAAT phtD.SEQ 
311.TCTTOATOAAOOATCCAAACTATCAACTTAAAOACOCTOATATTGTCAAT phtE.SEQ 

GAAGTCAAGGOTOOTTATGTTATCAAOOTAQATOQAAA ATACTATOTTTA Majority 

310 320 330 34°_ 3S0 

358 GAGGTCAAGGGTOOATATOTTATCAAGGTAGATGOAAAATACTATGTTTA phtA.SEO 
301 GAAATCAAGOOTGCTTATOTCATTAAOOTAAACOOTAAATACTATGTTTA phtB.ieo 
301 CAAATCAAGOGTCOTTATOTTATCAAOaTAGATOOAAAATACTATCTTTA phtD.SEO 
361 GAAOTCAAGOGTGOTTATATCATCAAOOTCOATGOAAAATATTATGTCTA phtE.SEQ 
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CCTTAAOCATGCACC CATOCOGATAATOTTCGOACAAAA GAAgAGATTA Hajority 

360 370 J» »0_ 400 

CC _ TAAOOA TGCTGCCCACGCGOATAACGTCCGTACAAAAGAGGAAATCA phtA.SEO 
351 rCTTAAGGATGCRGCTCATGCQGATAATATTCGGACAAAAGAAGAGATTA phCB.,., 
\\\ rrTTAAGGATGCAGCTCATOCGGATAATATTCGGACAAAAOAAGAGATTA phCD.SEO 
21 CCTGAAAGATGCAGCTCATOCTOATAATGTTCGAACTAAAGATOAAATCA phCE.SEO 

ATCOTeACAAGCAOOAA r A-rAGTCATAATCATGAGOOTOOAXCr.^.A- - Majority 

no «0 oo i£ «? 

ATCOAC AAAAACAA 0 AOCATAOTCAACATCOTGAAGGTGGAACTCCAAGA phtA.SEO 

01 AACCTCAOAAGCAOOAACOCAOTCATAATCATAACTCAAOACCA 

AACGTCAOAAOCAGOAACACAG TCATAATCACGGGGOTGOTTCT - - - 

«1 ATCGTCAAAAACAAGAACATOTCAAAOATAATGAO PhtE.SEO 

eATGATXX^QCTOTTOCTQTAOCCA OATCCCAAOQACOCTATACAACGOA Majority 

460 «™ «»» *?" 

<„. iicSATGOTOCTOTTgCCTTOOCA COTTCOCAAOOACGCTATACTACAOA phtA.SEO 
!« ^»t!aT^-GCTGTTOCTOCAOCCAOAOCCCAAOOACOTTATACAACGOA P ht8..«, 
IxO I I TTOCXOCCXO*OCCCX*00»COCt k f»CXXCCOX phtO.SEO 
m OTTAACTCTAATOTTOCTOTAOCAAOOTCTCAOOOACOATATACOACAAA phtE.SZO 

fflATGOTTATATCTTTAATQ CATCTOATATCATTOAOGATACGGOTQATO Majority 

si! 5» "0 ™_ !L° 

TGATGOTTATATCIIXAATac , „ , r , , r . , .„ 10 , . C AC S CC T O * T O pht8...Q 



1 U X » CXTCTOXtXtCXTtOXGGXCXCSOOTOXTO phtD.SEV 

Yq A jgcTTATOTCTTTAATCCAGCTGATATTATCGAAOATACOGGTAATO phtE.SEO 



S5B 
495 
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S4» 

rTTATATCGTTCCTCATGGCqATCATT ACCATTACATTCCTAAOAATqAG KaJorttJ 

' W S10 »0 »?_ 12! 

„„ PT .>TATCOTTCCTCATqqAGATC ATTACCATTACATTCCTAAGAATGAO phtA.SEO 

2! HJo TT ICXCOlCOXCCXtTXCCXTtXCXTtCCIXXOXXIOXO phtB..., 

I I T «XC GXCCXTtXCCXtTXCXTtCCTXXCXXIOXO phtO.SEO 

\\\ CTTATATCqTICCTCAiqqAOqTCACTATCACTACATTCCCAAAAOCOAT phtE.SEO 

TTXTCAOCTXQCQXOtTXflC TBCTQCXQXXQCC T A T T T G O A t G O G A Majority 

• «w ho »o «y ±° 

... TTiTCXOCTXOCOXOtTQBCTOCrO CXQXXOCCTTCCTXTCTOOTCSXCC phtA.SEO 

5 



A O 



, C A A AT--QQOATCTCOTCCTTCTTCAAOTTCT XQTTATACTT Majority 
1 ' 1 inn 



660 «?0 



AAATCTCTCAAATTCAAOAACCTATCOCCOACAAAATAOCOATAACACTT 



680 «0 TOO 



phtA.SE? 



! is I-CA OGGATCTCOTCCTTCTTCAAGTTCTAGTTATAATG pM8.SK 

I: ■ »X,X, CXXCC.X.IC AGTTA-AGCTATTCTT phtE.SE, 
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C AA-ATCCAGCTCAGTACCA AOATTOTCAQAGAACCACAAT--CT Majority 

i * * * * 

710 720 730 740 750 

* 1 1 i.i 

758 CAAGAACAAACTGGGTACCTTCTGTAAOCAATCCACGAACTACAAATACT phtA.SEQ 

677 CAA-ATCCAGCTCA--ACCA AGATTGTCAGAGAACCACAAT* * C T phtfl.teq 

680 CAA-ATCCAGCTCA--ACCA AGATTGTCAGAGAACCACAAT - - CT phtD.SEQ 

728 C A A - - - -CAGCT-AGT GACAAT - - AACA--CGCAATCTCT phtE.SEQ 

OACA-AAGCTOTCACTCCAACATTATCA-TCAAOCAAATCAAOGTOAAAA Majority 

760 770 780 790 800 

I 1 1 1 U 

608 AACACAAGCAACAACAOCAACACTAACAGTCAAGCAAGTCAAAGTAATGA phtA.SEQ 

7l 7 ex CTCTCACTCCAAC-TTATCA-TCAA---AATCAAGGGGAAAA phtB.»eq 

720 g A CTGTCACTCCAAC-TTATCA- T C A A - - -AATCAAGGGGAAAA phtD.SEQ 

759 ACCAAAAG-OATCA---- ACTAOCAAGCCAOCAAATAAATCTGAAAA phtE.SEQ 

CAT TTCAAOTCTTTTOCQTQAATTOTATQCTAAACCTTTATCAOAACGCC Majority 

'" 1 ' ^ ^ MMMM I — ■— — ^— — ; 

810 830 830 840 850 

' 1 1 ' 

85B CATTGATAOTCTCTTOAAACAOCTCTACAAACTOCCTTTOAGTCAACGAC phtA.SEQ 

756 CATTTCAAGCCTTTTACOTOAATTOTATGCTAAACCCTTATCAOAACGCC phtB.aaq 
759 CATTTCAAGCCTTTTACOTGAATTOTATOCTAAACCCTTATCAGAACGCC phtD.SEQ 
801 TCTCCACAOTCTTTTOAAOGAACTCTATOATTCACCTAOCOCCCAACOTT phtE.SEQ 

ATOTGGAATCTOAT QOCCTTOTTTTTOACCCAQCOCAAATCACAAGTCGA Majority 
I I I 1 1 

860 870 880 890 900 

t < t ' 1 

908 ATGTAGAATCTOATGOCCTTCTCTTTOATCCAGCACAAATCACAAGTCGA phtA.SEO 
806 ATGTOGAATCTOATGGCCTTATTTTCOACCCAGCGCAAATCACAAGTCGA phtB.soq 
809 ATGTGOAATCTGATGGCCTTATTTTCOACCCAOCOCAAATCACAAGTCGA phtD.SEQ 
851 ACAGTOAATCAOATOGCCTGOTCTTTOACCCTOCTAAGATTATCAOTCGT phtE.SEQ 

ACCGCCAGAOOTOTT OCTOTCCCTCATOOTOACCATTACCACTTTATCCC Majority 

I i 1 i 1 

910 930 930 940 950 

' 1 1 1 

958 ACAGCTAGAGOTGTTOCAOTOCCACACOOAOATCATTACCACTTCATCCC phtA.SEQ 

856 ACCGCCAGAGOTOTAGCTGTCCCTCATGGTAACCATTACCACTTTATCCC phtB.««q 
859 ACCCCCAGAOOTOTAOCTOTCCCTCATOOTAACCATTACCACTTTATCCC phtD.SEQ 
901 ACACCAAATGGAOTTOCOATTCCaCATOOCGACCATTACCACTTTATTCC phtE.SEQ 

TTATOAACAAATGTCT OAATTQQAAOAACQAATTOCTCOTATTATTCCCC Majority 
I I I I * 

960 970 980 990 1000 

1111 
1008 TTACTCTCAAATOTCTOAATTOOAAOAACOAATCOCTCGTATTATTCCCC phtA.SEQ 

906 TTATOAACAAATOTCTOAATTOOAAAAACOAATTOCTCOTATTATTCCCC phtB.»»q 

909 TTATGAACAAATOTCTGAATTGOAAAAACOAATTGCTCOTATTATTCCCC phtD.SEQ 
951 TTACAOCAAOCTTTCTOCCTTAOAAOAAAAGATTOCCAOAAT - phtE.SEQ 

TTCGTTATCG TTCA A A C C A T T OOGTACCAOATTCAAOACCAOAAOAACCA Majority 
I 1 1 I ' 

1010 1030 1030 1040 1050 

' ' ' 

1058 TTCGTTATCGTTCAAACCATTOOGTACCAOATTCAAOOCCAOAACAACCA phtA.SEQ 

956 TTCGTTATCCTTCAAACCATTGGGTACCAGATTCAAOACCAGAAGAACCA phtB.»«o 

959 TTCOTTATCOTTCAAACCATTOGGTACCAOATTCAAOACCAOAACAACCA phtD.SEQ 
993 - . "OOTCCC----T -ATCAOTGGAACTG phtE.SEQ 



11/17 



SUBSTITUTE SHEET (RULE 26) 



WO 00/37105 . PCT/US99/30390 



Figure 7 (d) 



ACTCCACAATCGACTCCGOA A CCTAGTCCAACTCCOCAAC CTGCACCAAA Majority 
{ | I I I 

1060 1070 1080 1090 1100 

* 1 1 ' 

1108 AGTCCACAACCCACTCCOOAACCTAGTCCAOCCCCCCAACCTCCACCAAA ptltA.SEQ 

1006 AGTCCACAACCOACTCCAGAACCTAOTCCAACTCCGCAACC phtB.»eq 

1009 AGTCCACAATCGACTCCOGAACCTACTCCAAGTCCGCAACCTCCACCAAA phCD.SEQ 

1013 GTTCTACAGTT TCTA---CAAA TGCA r -AAA phtE.SEQ 

■ i 

T C - T - A A . -AGCTCCAAOCAATCCAATTOATO-CAAATTCOTCAAAOAAG Majority 
— — 1 — ^ — i — p^^— i^— — ^ i 

1110 1120 1130 1140 1150 

' ' ' ' 

1158 TCTTAAAATAGACTCAA ATTCTTCT TTOGTTAGTCAGC phtA.SEQ 

1 047 . ....-AGCTCCAAOCAATCCAATTOATOGOAAATTOOTCAAAGAAG phtB.»eq 

1059 TCCTCAACCAOCTCCAAGCAATCCAATTOATOAOAAATTGGTCAAAGAAO phtO.SEQ 
1039 CC TAATO - phtE.SEQ 

CTGTTCG AAAA QTAGOCQATOOTTATOTCTTTOAQOAOAATOOAGTTTCT Majority 
I * * 1 
1160 1170 1180 1190 1200 

1196 TGOTACGAAAAGTTOGGOAAOOATATGTATTCOAAOAAAAOOaCATCTCT phCA.SEQ 

1088 CTGTTCOAAAAOTAOOCOATOOTTATOTCTTTGAOOAOAATOOAOTTTCT phtB.aeq 

1109 CTOTTCOAAAAOTAOOCOATOOTTATOTCTTTaAOGAOAATGOAGTTTCT phtO.SEQ 
1046 AAGTAO --TOTCT-- AOTCT- - phtE.SEQ 

CGTTATATCCCAOC CAAOOATCTTTCAqCAOAAACAQCAOCAOOCATTOA Majority 
III *' 
1210 1220 1230 1240 U50 

\ 1 f - 1 — 1 

124$ CGTTATGTCTTTGCQAAAOATTTACCATCTOAAACTCTTAAAAATCTTOA phtA.SEQ 

1138 COTTATATCCCAOCCAAOOATCTTTCAOCAOAAACAGCAOCAOCCATTOA phtB.aaq 

1159 CGTTATATCCCAGCCAAOOATCTTTCAOCAGAAACAOCAOCAOOCATTOA phtD.SEQ 
1062 - - - AOOC phtS.SEQ 

TACCAAACTOOCCAAOCAOOAAAOTTTTTCTCATAA O C TAP O A O CT A AG A Majority 

1260 1270 1280 - 1290 1300 
t ' i ' 1 ' ■ 
1296 AAGCAAGTTATCAAAACAAOAOAGTOTTTCACACACTTTAACTOCTAAAA phtA.SEQ 
1188 TAGCAAACTGGCCAAOCAOOAAAOTTTATCTCATAAOCTAOOAACTAAOA phtB.aeq 
1209 TAGCAAACTOGCCAAOCAOOAAAOTTTATCTCATAAOCTAOOAOCTAAOA phtO.SEQ 
1066 --AOTCTTTC phtE.SEQ 

AAACTGATCT T CCTTCTAOTOATCOAOAATTTTACQATAAGGCTTATGAC Majority 
I I * 1 . 

1310 X320 1330 1340 1350 

i I— — ^ 1— ^— — X— — ■ ■ 

1346 AAOAAAATOTTOCTCCTCOTOACCAAOAATTTTATOATAAAOCATATAAT phtA.SEQ 
1238 AAACTGACCTCCCATCTAOTOATCOAOAATTTTACAATAAOOCTTATOAC phtB.aaq 
1259 AAACTOACCTCCCATCTAOTOATCOAOAATTTTACAATAAGGCfTATOAC phtO.SEQ 
1074 - AAOCAATCCTTCTTCT TTAACOACAAO phtE.SEQ 

TTACTAOCAAO AATTCACCAAQATTTACTTOATAATAAOOOTCOACAAOT Majority 
I i I I * 
1360 1370 1380 1390 1400 
1396 CTGTTAACTOAOOCTCATAAAOCCTTOTTTOMAAATAAOOOTCOTAATTC phtA.SEQ 
1298 TTACTAG CAAOAATTCACCAAOATTTA CT TOATAATAAAOGTCOACAAGT phtB.aoq 
1309 TTACTAOCAAOAATTCACCAAOATTTACTTOATAATAAAOOTCOACAAGT phtO.SEQ 
110 1 . - - - T A A O G A phtE.SEQ 
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Figure 7 (e) 



TGATTTTGAGGCTT** OXTAACCTOTTOCAACGACTCXXOGXTGTCTCAA Majority 

1410 1420 1430 x«40 1450 
f ' ■ ' 
1446. TCATTTCCAAOCCTTAGACAAATTATTACAACGCTTGAATGATGAATCCA phtA.SEO 
133B TGATTTTGAGQCTTTOGATAACCTOTTOGAACGACTCAACOATGTCTCAX phtB.wq 
1359 TGATTTTGAGGCTTTGOATAACCTOTTGOAACGACTCAAOOATCTCCCAA phtD.SEC 
1107 ** * phtE.SEQ 

GTOATAAAOTCAAOTTAOTOOATOATATTCTTOCCTTCTTAOCTCCCATT Majority 

1460 1470 1480 1490 1500 

I ' 1 1 «JL 

1496 CTAATAAAGAAAAATTOGTAOATGATTTATTOGCATTCCTAGCACCAATT phtA.SEC 
1388 CTGATAAAGTCAAGTTAGTGGAAOATATTCTTOCCTTCTTAGCTCCGATT phtB.aaq 
1409 GTGATAAAOTCAAGTTAGTGGATOATATTCTTGCCTTCTTAGCTCCGATT phtD.SEO 
1107 OCTCTCTT phtE.SEQ 

COTCATCCXOXXCOTTTXCGXAAAC CAAATOCOCAAATTACCTACXCTG X Majority 

1510 1520 1530 1540 1550 

t M , t T I l_ 

1546 ACCCATCCAGAGCOACTTGOCAAACCAAATTCTCAAATTGAGTATACTGA phtA.SEC 
1438 CGTCATCCAGAACPTTTAGGAAAACCAAATGCGCAAATTACCTACACTGA phtB.saq 
1459 CGTCATCCAGAACGTTTAGGAAAACCAAATGCGCAAATTACCTACACTGA phtO.SEO 
1115 .... phtE.SSC 

TGATaAOATTCAAOTAQCCAAOTTQOCAOOCAAOTACACAOCATCAGXTO Majority 

* ■ • i I 
1560 1570 1580 1590 1600 

' 1 » 

1596 AGACOAAOTTCOTATTGCTCAATTAGCTOATAAGTATACAACGTCAOXTG phtA.SEO 

1488 TGATGAGATTCAAGTAGCCAAGTTGGCAGOCAAGTACACAOCAGAAGACG phtB.aao 

1509 TGATOAOATTCAAPTAGCCAAQTTOOCAQGCAAGTACACAACAGAACACG phtO.SEQ 

1115 C10CAICTOATO phtE.SEO 

Q T T A T A T T T T T 0 A T C C TCP T O A TAT A A C C APT O A T C A O 00 G O A T O C C T X T Majority 
< * i I " 1 " ". " '"■ 111 "P 

1610 1620 1630 1640 1650 

lift u 

1646 CTTACATTTTTOATOAACATGATATAATCAGTOATOAAGQAOATGCATAT phtA.SEp 
1S38 GTTATATCTTTOATCCTCGTOATATAACCAGTQATPAGGOGPATGCCTAT phtB.sa? 
1559 GTTATXTCTTTOATCCTCOTOATATAACCAGTOATOAGOOOGATOCCTXT phtO.SEQ 
1127 OTTATXTTTTTAXTCC - - phtZ.SEp 

OTAACTCCACATATQACCCAT AO CCACTQOATTAAAAAAQATAOTTTGTC Majority 

1660 1670 1680 1690 1700 

* 9 1 

1696 GTXXCaCCTCATATOOaCCATAGTCACTOGATTOOAAAAOATAOCCTTTC phtA.SBQ 

1588 GTXXCTCCXCATATGACCCATAGCCACTOOATTAAAAAAOATA0TTT0TC phtB.aaO 

1609 CTXXCTCCXCATATOACCCATAOCCACTOOATTAAAAAAGATXGTTTGTC phtO.SEQ 

1143 --AAAAOATA - T C phtE.SEQ 

TOAAQCTGAQAOAGCQOCAQCCCAOGCTTATOCTAAAOAOAAAQQTT T G A Majorlt 

' 1 | j 

1710 1720 1730 1740 1750 
1 ' ' ' 
1746 TGATAAGGAAAAAPTTGCAPCTCAAPCCTATACTAAAOAAAAAGGTATCC phtA.SEp 
1638 TGAAGCTGAOAOAOCQGCAGCCCAOGCTTATOCTRAAOAGXXXOGTTTOX phtB.a*? 
1659 TGAAGCTGAPAGAPCGPCAPCCCAGGCTTATGCTAAAGAPAAAGGTTTGA phtO.SEQ 
1153 fllTOAXOXXXCOOC - phtE.SEQ 
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Figure 7(f) 



CCCCTCCTTCOACAO ; C A T C AQO A TT C AC C X UT ACT C A C G C A U K C C A Majority 

1760 1770 17B0 1790 1800 
' 1 ' t « 
1796 TACCTCCATCTCCACACCCACATBTTAAAGCAAATCCAACTCCACATACT phtA.SEQ 
1688 CCCCTCCTTCGACAOACCATCAOGATTCAGGAAATACTQAGCCAAAAGGA phtS.teq 
1709 CCCCTCCTTCOACAGACCATCAGOATTCAGGAAATACTOAGGCAAAAGGA phtO.SEQ 
1167 - phtE.SEQ 

GCAGAAOCTATCTACAACCQXGTOAAAQCAGCTAAOAAGGTOCCA C'T T G A Majority 

* » I I i 
1B10 1820 1830 1840 1850 

' ' T I U 

1846 GCAGCAGCtATTTACAATCGTOTOAAAGOOOAAAAACGAATTCCACTCGT phtA.SEQ 
1738 OCAOAAGCTATCTACAACCGHOTOAAAOCAGCTAAOAAGCTGCCACTTGA phtB.aeq 
1759 GCAOAAGCTATCTACAACCGCGTOAAAGCAOCTAAOAAGOTGCCACTTGA phtO.SEQ 
1167 - - T A C A O C T . phtE.SEQ 

TCGTATOCCTTACAATCTTCAATATACTGTAOAAOTCAAAAACOOTAGTT Majority 

1860 1870 1880 1890 1900 

1896 TCGACTTCCATATATOOTTOAOCATACAOTTOAOOTTAAAAACGGTAATT phtA.SEQ 

1788 TCQTATOCCTTACAATCTTCAATATACTOTAOAAOTCAAAAACGGTAGTT phtB.aaq 

1809 TCOTATGCCTTACAATCTTCAATATACTOTAOAAGTCAAAAACCOTAOTT phtO.SEQ 

1174 TATATTOTAAOA-- phtE.SEQ 

TAATCATACCTCATTATOATCATTACCATAACATTAA'ATTTOAOTOOTTT Majority 

* i i 1 i 
1910 1920 1930 1940 1950 

1 ' ' t t 

1946 TGATTATTCCTCATAAOOATCATTACCATAATATTAAATTTGCTTQOTTT phtA.SEQ 

1838 TAATCATACCTCATTATOACCATTACCATAACATCAAATTTGAGTOGTTT phtB.sw? 

1859 TAATCATACCTCATTATOACCATTACCATAACATCAAATTTQAOTGQTTT phtO.SEC 

1186 CATGOTOATCATTTCCATTACATT phtZ.SEC 

OACQAAOOCCTTTATOAOOCACCTAAOOOOTATACTCTTOAQOATCTTTT Majority 
••ill 
1960 1970 19S0 1990 2000 

1 1 I I L. 

1996 GATQATCACACATACAAAGCTCCAAATOaCTATACCTTOOAAGATTTGTT phtA.SEO 
1888 OACGAAGOCCTTTATOAOOCACCTAAOOaGTATACTCTTOAGOATCTTTT phtB.a**, 
1909 GACOAAOOCCTTTATOAOOCACCTAAOOaOTATACTCTTOAOOATCTTTT phtO.SEQ 
1210 C C A A A phtE.SEC 

GOCQACTOTCAAGTACTATOTCOAACATCCAQACOAACO T C C OCATTCAO Majority 

* i i i i 
2010 2020 2030 2040 2050 

1 1 • 1 ' ' • 

2046 TGCQACQATTAAOTACTACGTAOAACACCCTQACQAACaTCCACATTCTA phtA.SEQ 

1938 OOCOACTOTCAAOTACTATOTCOAACATCCAXACOAAC'OTCCOCATTCAO phtB.aao, 

1959 GGCGACTOTCAAOTACTATOTCGAACATCCAAACOAACOTCCOCATTCAG phtO.SEQ 

1215 ATCAAAT--CAAATTOOOCAACC-QAC • T C T • • - TCCAA phtE.SEQ 

ATAATOOTTTTOOTAACOCTAOCOACCATOTTTTXA-AAACAAOAAAOAT Majority 

2060 2070 2080 2090 2100 . 

1 1 

2096 ATOATOOATOOOOCAATOCCAOTOAOCATOTOTT---AOGCAAOAAAGAC phtA.SIt? 
19B8 ATAATOOTTTTOOTAACOCTAOCaACCATOTTCAAAOAAACAAAAATOOT phtB.a*^ 
2009 ATAATOOTTTTCOTAACOCTAOCOACCATOTTCOTAAAAATAAGOTAGAC phtO.SEQ 
1247 ACAATAO-TCTAOCAACACCTTCT-CCATCTCTTC-----CAA------T phtE.SE? 
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Figure 7 (g) 



CAAOCCAOTAAACCTAATOAACATC AOAAACATQACCAACTAAG-CAO-- Majority 
2110 2130 2130 



2140 2150 

2143 C A * • CAGTGAAGAT 



-« ' » ■),, 



, -CCAAATAAG - - - phtA.SEO 

llll j * 0 " 8 »»-«»*»"-»««»»»» XCCAAGCOAG-GAGAA phtB..., 

2059 CAAGACAGTAAACCTGATOAAGATAAOGAACATGATGAAGTAAGTGAGCC phtO.SEQ 

1284 CAATCCAG-OAACTTCAC ATGAOAAACATGA phCS.SEQ 

A-CTCA--C-OAA taUOUfl t . U CC >C»..O.ITtH>tCCT. Majority 

™2 ». 70 il»0 2190 «oo 

2079 AccTCAolcIoZZZilcciGAooIloA-^Acc"!::::!!^-*:;;;; ptltis 

2109 AACTCACCCTGAATCTOATGAAAAAGAOAATCACGCTGGTTTAAATCCTT phCD^ 
... . - -----*CAAGATGOATACO--GATTTGA-TGCT - phtE.SEQ 

-AGCAOATAAACeOTATAAOCCAO--A C 



t - •--» - A-AC--A---A Kajority 

ff^ "» 2230 2240 2JS0 

2lTi - -OCOGATOAA- OAOCCAO . . . .! . _ _ J heA _ 

2114 OAGAAOAOAAACCOCA-AAOCOAOAAACCAOAOTCTCCAAAACCAACAOA pbtsiaaa 
2159 CAGCAOATAATCTTTATAAACCAAOCACTOATACOOAAOAOACAOAOOAA phCO.SEQ 
AATCOTATTATC phtt.SXQ 

O-AOCTOOAOaAAXCACCA OATOAOTCAOAAOTXCCTCAAOT A G A O A C T O Majority 

ffff J 2 ™ 22»0 2M0 2300 

2X89 T * o *OO***CACCT0CTOAacCAOAAGTCCCTCAAOTAGAGACTG phtA.5ZQ 

2163 CGAACCAOAAGAATCACCAOAOaAATCAOAAGAACCTCAacrCOAGACTO phtB.aaq 
2209 OAAGCTOAAGATACCAC-AOATOAOOCTGAAATTCCTCAAOTAOAOAATT phtO.SSQ 
1351 OCTOAAOA TBAAIC pKB.SXQ 

A A A A XSTTOAAaCXAAACTXAXAOAX OCXOAOOTTTTOCTTOXAAAAOTC Majority 

»?» 2330 2340 2350 

*A*AAOTAOAAOCCCAACTCAAAOAAOCAOAACTTTTOc4tOCOAAAGTA phtA.SW 
2213 AAAAGOTTOAAOAAAAACTOAOAOAOOCTOAAOATTTACTTGOAAAAATC phtB.aaq 
ills !I GTTATTXAC0CTA *' 0AT * BCAO,kT0COOAOOC "TOCTAOAAAAAGTA phtD.SBQ 

. ...... AGOTTTTO IC phtE.SKQ 



ACGOATCCyAOTATXAAAXC CAATOCXACOgAQACTCTXACTOQTTTAAA Majority 

^ !!Z! "j" "?° 

*COOATTCTAOTCTOAAAOCCAATOCAACAOAAACTCTAOCTOOTTTACG phtA.SZO 
llll 1 A !° A ! = CAATTXTCAAOTCC ***OCeAAAOAGACTCTCACAGOATrAAA pbtB.al, 
2308 ACAOATCCTAOTXTTAO*CAAAATOCTATOOAOACATTOACTOOTCTAAA phtO.SEQ 
1375 ATOAOTC • ACOGAOACC phCS.SEC 

AAATAATTTXCTTCTTOOAAC XAAOOATAATAATACTATTTTOaCAOAAC Majority 

2 J£ 2430 2440 2450 

***TAATTTOACTCTTCAAATTATOOATAACAATAaiATCATGGCAGAAd pheA.SB© 
U H»!A*TITACTAtt:QOCiCCC»aOAC»XCUT»Ct»tr*t«0C»G»» O phtB..., 
2358 AAOTAOTCTTCTTCTCGOAACOAAAOATAATAACACTATTTCAOC AG A AO pf.CD.STO 
.... ... ...... -ACAAT--CATTATTTCTTCA--- - phCE.SXO 
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Figure 7 (h) 



CAOAAAGACTATTOOCTTTGTTAAAOQAGAQ TAAXT-A.AGGT CTT Majority 

™ 2 J?1 3480 J490 2500 

2384 CACAAAAATTACTTQCCTTOTTAAAAGCAAOTAATC- - • - C T t' phtA SEC 

2363 CTCAAAAACTATTOOCTTTATTAAAOOAOAOTAAGTAAAOOTAOAAGCTT phtfl*«.a 

2408 TAGATAOTCTCTTGOCTTTOTTAAAAOAAAGT - ...... c phto'sEO 

1409 - AOAAOGACT - • TO AC AGAAGAGCAAATTAAGGT phtE^EQ 

AA-.QCO-.TCTOOC-CCTA-G-CAA.AA> A>T--TATGOXAAAAg"cTXA Majority 

«2 22! 2530 2550 

2423 CX TCT0 - TAAOGAAAAAAT - - - phtA.SEQ 

2413 AAGGGCGAATTTGGCACCCAGGACAACAATACTATTATGGCAGAAGCTGA phtS..«q 

VAl ^---"---CCOOCTCCTA TATAOTAAAACCTTA phtD.SSQ 

1440 GCO CAAAAACATT TAG pUtT.SEQ 

AAAACTAXX w . . _ 

— — — — Major icy 



2445 -AAACTAA 
2463 AAAACTATT 

2468 AO - C C 

1455 



phtA.SEQ 
phtB.sttq 
phCO.SEQ 
pbtE.SEQ 
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Figure 8 



Strain SJ2 (serotype 6B) 
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SEQUENCE LISTING 

<110> Johnson, Leslie S. 
Koenig, Scott 
Adamou, John E. 

<120> Streptococcus Pneumoniae and Immunogenic Fragments for 
Vaccines 

<130> 469201-444 

<140> 
<141> 

<150> 60/113,048 
<151> 1998-12-21 

<160> 11 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Forward primer 
used in amplification of the Sp36 gene sequence. 

<400> 1 

atcggatcct tcttacgagt tgggactgta tcaagc 

<210> 2 
<2li> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Forward primer 
used in amplification of the Sp36 gene sequence. 

<400> 2 

atcggatcca ctgtatcaag ctagaacggt taagg 

<210> 3 
<211> 40 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Reverse primer 
used in amplification of the Sp3 6 gene sequence. 

<400> 3 

agtcaagctt gtttattttt tccttactta cagatgaagg 40 

<210> 4 
<211> 838 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 4 

Met Lys lie Asn Lys Lys Tyr Leu Ala Gly Ser Val Ala Val Leu Ala 
1 5 io 15 

Leu Ser Val Cys Ser Tyr Glu Leu Gly Arg His Gin Ala Gly Gin Val 
20 25 30 

Lys Lys Glu Ser Asn Arg Val Ser Tyr He Asp Gly Asp Gin Ala Gly 
35 40 45 

Gin Lys Ala Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly 
50 55 60 

He Asn Ala Glu Gin He Val He Lys He Thr Asp Gin Gly Tyr Val 
65 70 75 80 

Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr 
85 90 95 

Asp Ala He He Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gin 
100 105 no 

Leu Lys Asp Ser Asp He Val Asn Glu He Lys Gly Gly Tyr Val He 
115 120 125 

Lys Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala 
130 135 140 

Asp Asn He Arg Thr Lys Glu Glu He Lys Arg Gin Lys Gin Glu His 
145 150 155 leo 

Ser His Asn His Gly Gly Gly Ser Asn Asp Gin Ala Val Val Ala Ala 
165 170 175 
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Arg Ala Gin Gly Arg Tyr Thr Thr Asp Asp Gly Tyr lie Phe Asn Ala 
180 185 190 

Ser Asp lie lie Glu Asp Thr Gly Asp Ala Tyr lie Val Pro His Gly 
195 200 205 

Asp His Tyr His Tyr lie Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu 
210 215 220 

Ala Ala Ala Glu Ala Tyr Trp Asn Gly Lys Gin Gly Ser Arg Pro Ser 
225 230 235 240 

Ser Ser Ser Ser Tyr Asn Ala Asn Pro Ala Gin Pro Arg Leu Ser Glu 
245 250 255 

Asn His Asn Leu Thr Val Thr Pro Thr Tyr His Gin Asn Gin Gly Glu 
260 265 270 

Asn lie Ser Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu 
275 280 285 

Arg His Val Glu Ser Asp Gly Leu lie Phe Asp Pro Ala Gin lie Thr 
290 295 300 

Ser Arg Thr Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His 
305 310 315 320 

Phe He Pro Tyr Glu Gin Met Ser Glu Leu Glu Lys Arg He Ala Arg 
325 330 335 

He He Pro Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg 
340 345 350 

Pro Glu Gin Pro Ser Pro Gin Ser Thr Pro Glu Pro Ser Pro Ser Pro 
355 360 365 

Gin Pro Ala Pro Asn Pro Gin Pro Ala Pro Ser Asn Pro He Asp Glu 
370 375 380 

Lys Leu Val Lys Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe 
385 390 395 400 

Glu Glu Asn Gly Val Ser Arg Tyr He Pro Ala Lys Asp Leu Ser Ala 
405 410 415 

Glu Thr Ala Ala Gly He Asp Ser Lys Leu Ala Lys Gin Glu Ser Leu 
420 425 430 
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Ser His Lys Leu Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg 
435 440 445 

Glu Phe Tyr Asn Lys Ala Tyr Asp Leu Leu Ala Arg lie His Gin Asp 
450 455 460 

Leu Leu Asp Asn Lys Gly Arg Gin Val Asp Phe Glu Ala Leu Asp Asn 
465 470 475 480 

Leu Leu Glu Arg Leu Lys Asp Val Pro Ser Asp Lys Val Lys Leu Val 
485 490 495 

Asp Asp lie Leu Ala Phe Leu Ala Pro lie Arg His Pro Glu Arg Leu 
500 505 510 

Gly Lys Pro Asn Ala Gin lie Thr Tyr Thr Asp Asp Glu lie Gin Val 
515 520 525 

Ala Lys Leu Ala Gly Lys Tyr Thr Thr Glu Asp Gly Tyr lie Phe Asp 
530 535 540 

Pro Arg Asp lie Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His 
545 550 555 560 

Met Thr His Ser His Trp He Lys Lys Asp Ser Leu Ser Glu Ala Glu 
565 570 575 

Arg Ala Ala Ala Gin Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro 
580 585 590 

Ser Thr Asp His Gin Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu 
595 600 605 

Ala He Tyr Asn Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg 
610 615 620 

Met Pro Tyr Asn Leu Gin Tyr Thr Val Glu Val Lys Asn Gly Ser Leu 
625 630 635 640 

He He Pro His Tyr Asp His Tyr His Asn He Lys Phe Glu Trp Phe 
645 650 655 

Asp Glu Gly Leu Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu 
660 665 670 

Leu Ala Thr Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His 
675 680 685 
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Ser Asp Asn Gly Phe Gly Asn Ala Ser Asp His Val Arg Lys Asn Lys 
690 695 700 

Val Asp Gin Asp Ser Lys Pro Asp Glu Asp Lys Glu His Asp Glu Val 
705 710 715 720 

Ser Glu Pro Thr His Pro Glu Ser Asp Glu Lys Glu Asn His Ala Gly 
725 730 735 

Leu Asn Pro Ser Ala Asp Asn Leu Tyr Lys Pro Ser Thr Asp Thr Glu 
740 745 750 

Glu Thr Glu Glu Glu Ala Glu Asp Thr Thr Asp Glu Ala Glu lie Pro 
755 760 765 

Gin Val Glu Asn Ser Val lie Asn Ala Lys lie Ala Asp Ala Glu Ala 
770 775 780 

Leu Leu Glu Lys Val Thr Asp Pro Ser He Arg Gin Asn Ala Met Glu 
785 790 795 800 

Thr Leu Thr Gly Leu Lys Ser Ser Leu Leu Leu Gly Thr Lys Asp Asn 
805 810 815 

Asn Thr He Ser Ala Glu Val Asp Ser Leu Leu Ala Leu Leu Lys Glu 
820 825 830 

Ser Gin Pro Ala Pro He 
835 



<210> 5 
<211> 2531 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 5 

atgaaaatta ataaaaaata tctagcaggt 
tcctatgaac ttggtcgtca ccaagctggt 
tatatagatg gtgatcaggc tggtcaaaag 
aagagggagg ggatcaacgc cgaacaaatc 
acctctcatg gagaccatta tcattactat 
agtgaagagc tcctcatgaa agatccgaat 
gaaatcaagg gtggttatgt tatcaaggta 
gcagctcatg cggataatat tcggacaaaa 
agtcataatc acgggggtgg ttctaacgat 
cgctatacaa cggatgatgg ttatatcttc 
gatgcttata tcgttcctca cggcgaccat 



tcagtggcag tccttgccct aagtgtttgt 60 
caggttaaga aagagtctaa tcgagtttct 120 
gcagaaaact tgacaccaga tgaagtcagt 180 
gtcatcaaga ttacggatca aggttatgtg 24 0 
aatggcaagg tcccttatga tgccatcatc 3 00 
tatcagttga aggattcaga cattgtcaat 360 
gatggaaaat actatgttta ccttaaggat 420 
gaagagatta aacgtcagaa gcaggaacac 480 
caagcagtag ttgcagccag agcccaagga 540 
aatgcatctg atatcattga ggacacgggt 600 
taccattaca ttcctaagaa tgagttatca 660 
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gctagcgagt tagctgctgc agaagcctat 
tcaagttcta gttataatgc aaatccagct 
actgtcactc caacttatca tcaaaatcaa 
ttgtatgcta aacccttatc agaacgccat 
gcgcaaatca caagtcgaac cgccagaggt 
tttatccctt atgaacaaat gtctgaattg 
cgttatcgtt caaaccattg ggtaccagat 
actccggaac ctagtccaag tccgcaacct 
ccaattgatg agaaattggt caaagaagct 
gaggagaatg gagtttctcg ttatatccca 
ggcattgata gcaaactggc caagcaggaa 
actgacctcc catctagtga tcgagaattt 
attcaccaag atttacttga taataaaggt 
ctgttggaac gactcaagga tgtcccaagt 
gccttcttag ctccgattcg tcatccagaa 
tacactgatg atgagattca agtagccaag 
tatatctttg atcctcgtga tataaccagt 
atgacccata gccactggat taaaaaagat 
caggcttatg ctaaagagaa aggtttgacc 
aatactgagg caaaaggagc agaagctatc 
ccacttgatc gtatgcctta caatcttcaa 
atcatacctc attatgacca ttaccataac 
tatgaggcac ctaaggggta tactcttgag 
gaacatccaa acgaacgtcc gcattcagat 
cgtaaaaata aggtagacca agacagtaaa 
agtgagccaa ctcaccctga atctgatgaa 
gcagataatc tttataaacc aagcactgat 
accacagatg aggctgaaat tcctcaagta 
gatgcggagg ccttgctaga aaaagtaaca 
acattgactg gtctaaaaag tagtcttctt 
gcagaagtag atagtctctt ggctttgtta 
aagcttaagc c 



PCT/US99/30390 

tggaatggga agcagggatc tcgtccttct 720 
caaccaagat tgtcagagaa ccacaatctg 780 
ggggaaaaca tttcaagcct tttacgtgaa 84 0 
gtggaatctg atggccttat tttcgaccca 900 
gtagctgtcc ctcatggtaa ccattaccac 960 
gaaaaacgaa ttgctcgtat tattcccctt 1020 
tcaagaccag aacaaccaag tccacaatcg 1080 
gcaccaaatc ctcaaccagc tccaagcaat 1140 
gttcgaaaag taggcgatgg ttatgtcttt 1200 
gccaaggatc tttcagcaga aacagcagca 1260 
agtttatctc ataagctagg agctaagaaa 1320 
tacaataagg cttatgactt actagcaaga 1380 
cgacaagttg attttgaggc tttggataac 1440 
gataaagtca agttagtgga tgatattctt 1500 
cgtttaggaa aaccaaatgc gcaaattacc 156 0 
ttggcaggca agtacacaac agaagacggt 1620 
gatgaggggg atgcctatgt aactccacat 1680 
agtttgtctg aagctgagag agcggcagcc 174 0 
cctccttcga cagaccatca ggattcagga 1800 
tacaaccgcg tgaaagcagc taagaaggtg 186 0 
tatactgtag aagtcaaaaa cggtagttta 1920 
atcaaatttg agtggtttga cgaaggcctt 1980 
gatcttttgg cgactgtcaa gtactatgtc 204 0 
aatggttttg gtaacgctag cgaccatgtt 2100 
cctgatgaag ataaggaaca tgatgaagta 2160 
aaagagaatc acgctggttt aaatccttca 2220 
acggaagaga cagaggaaga agctgaagat 2280 
gagaattctg ttattaacgc taagatagca 23 40 
gatcctagta ttagacaaaa tgctatggag 2400 
ctcggaacga aagataataa cactatttca 2460 
aaagaaagtc aaccggctcc tatatagtaa 2520 

2531 



<210> 6 

<211> 484 

<212> PRT 

<213> Streptococcus pneumoniae 



<400> 6 

Met Lys Phe Ser Lys Lys Tyr lie 
1 5 

Ser Leu Ser Leu Cys Ala Tyr Ala 
20 

Asn Lys Asp Asn Asn Arg Val Ser 
35 40 



Ala Ala Gly Ser Ala Val lie Val 
10 15 

Leu Asn Gin His Arg Ser Gin Glu 
25 30 

Tyr Val Asp Gly Ser Gin Ser Ser 
45 



6 



WO 00/37105 



PCTAJS99/30390 



Gin Lys Ser Glu Asn Leu Thr Pro Asp Gin Val Ser Gin Lys Glu Gly 
50 55 60 

lie Gin Ala Glu Gin lie Val lie Lys lie Thr Asp Gin Gly Tyr Val 
65 70 75 80 

Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr 
85 90 95 

Asp Ala Leu Phe Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gin 
100 105 110 

Leu Lys Asp Ala Asp He Val Asn Glu Val Lys Gly Gly Tyr He He 
115 120 125 

Lys Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala 
130 135 140 

Asp Asn Val Arg Thr Lys Asp Glu He Asn Arg Gin Lys Gin Glu His 
145 150 155 160 

Val Lys Asp Asn Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser 
165 170 175 

Gin Gly Arg Tyr Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp 
180 185 190 

He He Glu Asp Thr Gly Asn Ala Tyr He Val Pro His Gly Gly His 
195 200 205 

Tyr His Tyr He Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala 
210 215 220 

Ala Lys Ala His Leu Ala Gly Lys Asn Met Gin Pro Ser Gin Leu Ser 
225 230 235 240 

Tyr Ser Ser Thr Ala Ser Asp Asn Asn Thr Gin Ser Val Ala Lys Gly 
245 250 255 

Ser Thr Ser Lys Pro Ala Asn Lys Ser Glu Asn Leu Gin Ser Leu Leu 
260 265 270 

Lys Glu Leu Tyr Asp Ser Pro Ser Ala Gin Arg Tyr Ser Glu Ser Asp 
275 280 285 

Gly Leu Val Phe Asp Pro Ala Lys He He Ser Arg Thr Pro Asn Gly 
290 295 300 
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Val Ala lie Pro His Gly Asp His Tyr His Phe lie Pro Tyr Ser Lys 
305 310 315 320 

Leu Ser Ala Leu Glu Glu Lys lie Ala Arg Met Val Pro He Ser Gly 
325 330 335 

Thr Gly Ser Thr Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser 
34 <> 345 350 

Ser Leu Gly Ser Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys 
355 360 365 

Glu Leu Ser Ser Ala Ser Asp Gly Tyr He Phe Asn Pro Lys Asp He 
370 375 380 

Val Glu Glu Thr Ala Thr Ala Tyr He Val Arg His Gly Asp His Phe 
385 390 395 400 

His Tyr He Pro Lys Ser Asn Gin He Gly Gin Pro Thr Leu Pro Asn 
405 410 415 

Asn Ser Leu Ala Thr Pro Ser Pro Ser Leu Pro He Asn Pro Gly Thr 
42 <> 425 430 

Ser His Glu Lys His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg 
43 5 440 445 

He He Ala Glu Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His 
450 455 460 

Asn His Tyr Phe Phe Lys Lys Asp Leu Thr Glu Glu Gin lie Lys Val 
465 4 ™ 475 480 

Arg Lys Asn He 



<210> 7 
<211> 1455 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 7 

atgaaattta gtaaaaaata tatagcagct 
tgtgcctatg cactaaacca gcatcgttcg 
tatgtggatg gcagccagtc aagtcagaaa 
cagaaagaag gaattcaggc tgagcaaatt 
acgtcacacg gtgaccacta tcattactat 



ggatcagctg ttatcgtatc cttgagtcta 60 
caggaaaata aggacaataa tcgtgtctct 120 
agtgaaaact tgacaccaga ccaggttagc 180 
gtaatcaaaa ttacagatca gggctatgta 240 
aatgggaaag ttccttatga tgccctcttt 300 
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agtgaagaac tcttgatgaa ggatccaaac tatcaactta aagacgctga tattgtcaat 3 60 

gaagtcaagg gtggttatat catcaaggtc gatggaaaat attatgtcta cctgaaagat 420 

gcagctcatg ctgataatgt tcgaactaaa gatgaaatca atcgtcaaaa acaagaacat 480 

gtcaaagata atgagaaggt taactctaat gttgctgtag caaggtctca gggacgatat 54 0 

acgacaaatg atggttatgt ctttaatcca gctgatatta tcgaagatac gggtaatgct 600 

tatatcgttc ctcatggagg tcactatcac tacattccca aaagcgattt atctgctagt 660 

gaattagcag cagctaaagc acatctggct ggaaaaaata tgcaaccgag tcagttaagc 720 

tattcttcaa cagctagtga caataacacg caatctgtag caaaaggatc aactagcaag 780 

ccagcaaata aatctgaaaa tctccagagt cttttgaagg aactctatga ttcacctagc 84 0 

gcccaacgtt acagtgaatc agatggcctg gtctttgacc ctgctaagat tatcagtcgt 900 

acaccaaatg gagttgcgat tccgcatggc gaccattacc actttattcc ttacagcaag 960 

ctttctgcct tagaagaaaa gattgccaga atggtgccta tcagtggaac tggttctaca 1020 

gtttctacaa atgcaaaacc taatgaagta gtgtctagtc taggcagtct ttcaagcaat 1080 

ccttcttctt taacgacaag taaggagctc tcttcagcat ctgatggtta tatttttaat 1140 

ccaaaagata tcgttgaaga aacggctaca gcttatattg taagacatgg tgatcatttc 1200 

cattacattc caaaatcaaa tcaaattggg caaccgactc ttccaaacaa tagtctagca 1260 

acaccttctc catctcttcc aatcaatcca ggaacttcac atgagaaaca tgaagaagat 132 0 

ggatacggat ttgatgctaa tcgtattatc gctgaagatg aatcaggttt tgtcatgagt 13 80 

cacggagacc acaatcatta tttcttcaag aaggacttga cagaagagca aattaaggtg 144 0 
cgcaaaaaca tttag 1455 

<210> 8 
<211> 819 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 8 

Met Lys lie Asn Lys Lys Tyr Leu Val Gly Ser Ala Ala Ala Leu He 
1 5 10 15 

Leu Ser Val Cys Ser Tyr Glu Leu Gly Leu Tyr Gin Ala Arg Thr Val 
20 25 30 

Lys Glu Asn Asn Arg Val Ser Tyr He Asp Gly Lys Gin Ala Thr Gin 
35 40 45 

Lys Thr Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly He 
50 55 60 

Asn Ala Glu Gin He Val He Lys He Thr Asp Gin Gly Tyr Val Thr 
65 70 75 80 

Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp 
85 90 95 

Ala He He Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Lys Leu 
100 105 HO 
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Lys Asp Glu Asp He Val Asn Glu Val Lys Gly Gly Tyr Val He Lys 
115 120 125 

Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp 
"0 135 140 

Asn Val Arg Thr Lys Glu Glu He Asn Arg Gin Lys Gin Glu His Ser 
145 150 155 160 

Gin His Arg Glu Gly Gly Thr Pro Arg Asn Asp Gly Ala Val Ala Leu 
!65 170 175 

Ala Arg Ser Gin Gly Arg Tyr Thr Thr Asp Asp Gly Tyr He Phe Asn 
180 185 190 

Ala Ser Asp He He Glu Asp Thr Gly Asp Ala Tyr He Val Pro His 
I 95 200 205 

Gly Asp His Tyr His Tyr He Pro Lys Asn Glu Leu Ser Ala Ser Glu 
210 215 220 

Leu Ala Ala Ala Glu Ala Phe Leu Ser Gly Arg Gly Asn Leu Ser Asn 
225 230 235 240 

Ser Arg Thr Tyr Arg Arg Gin Asn Ser Asp Asn Thr Ser Arg Thr Asn 
245 250 255 

Trp Val Pro Ser Val Ser Asn Pro Gly Thr Thr Asn Thr Asn Thr Ser 
260 265 270 

Asn Asn Ser Asn Thr Asn Ser Gin Ala Ser Gin Ser Asn Asp He Asp 
275 280 285 

Ser Leu Leu Lys Gin Leu Tyr Lys Leu Pro Leu Ser Gin Arg His Val 
290 295 300 

Glu Ser Asp Gly Leu Val Phe Asp Pro Ala Gin He Thr Ser Arg Thr 
305 310 315 320 

Ala Arg Gly Val Ala Val Pro His Gly Asp His Tyr His Phe He Pro 
325 330 335 

Tyr Ser Gin Met Ser Glu Leu Glu Glu Arg He Ala Arg He He Pro 
340 345 350 

Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gin 
355 360 365 
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Pro Ser Pro Gin Pro 
370 

Pro Asn Leu Lys lie 
385 

Arg Lys Val Gly Glu 
405 

Tyr Val Phe Ala Lys 
420 

Ser Lys Leu Ser Lys 
435 

Lys Glu Asn Val Ala 
450 

Asn Leu Leu Thr Glu 
465 

Asn Ser Asp Phe Gin 
485 

Glu Ser Thr Asn Lys 
500 

Ala Pro He Thr His 
515 

Glu Tyr Thr Glu Asp 
530 

Thr Thr Ser Asp Gly 
545 

Glu Gly Asp Ala Tyr 
565 

Gly Lys Asp Ser Leu 
580 

Thr Lys Glu Lys Gly 
595 

Ala Asn Pro Thr Gly 
610 



Thr Pro Glu Pro Ser Pro 
375 

Asp Ser Asn Ser Ser Leu 
390 395 

Gly Tyr Val Phe Glu Glu 
410 

Asp Leu Pro Ser Glu Thr 
425 

Gin Glu Ser Val Ser His 
440 

Pro Arg Asp Gin Glu Phe 
455 

Ala His Lys Ala Leu Phe 
470 475 

Ala Leu Asp Lys Leu Leu 
490 

Glu Lys Leu Val Asp Asp 
505 

Pro Glu Arg Leu Gly Lys 
520 

Glu Val Arg He Ala Gin 
535 

Tyr He Phe Asp Glu His 
550 555 

Val Thr Pro His Met Gly 
570 

Ser Asp Lys Glu Lys Val 
585 

He Leu Pro Pro Ser Pro 
600 

Asp Ser Ala Ala Ala He 
615 



Gly Pro Gin Pro Ala 
380 

Val Ser Gin Leu Val 
400 

Lys Gly He Ser Arg 
415 

Val Lys Asn Leu Glu 
430 

Thr Leu Thr Ala Lys 
445 

Tyr Asp Lys Ala Tyr 
460 

Glu Asn Lys Gly Arg 
480 

Glu Arg Leu Asn Asp 
495 

Leu Leu Ala Phe Leu 
510 

Pro Asn Ser Gin He 
525 

Leu Ala Asp Lys Tyr 
540 

Asp He He Ser Asp 
560 

His Ser His Trp He 
575 

Ala Ala Gin Ala Tyr 
590 

Asp Ala Asp Val Lys 
605 

Tyr Asn Arg Val Lys 
620 
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Gly Glu Lys Arg lie Pro Leu Val Arg Leu Pro Tyr Met Val Glu His 
625 630 635 640 

Thr Val Glu Val Lys Asn Gly Asn Leu lie He Pro His Lys Asp His 
645 650 655 

Tyr His Asn He Lys Phe Ala Trp Phe Asp Asp His Thr Tyr Lys Ala 
660 665 670 

Pro Asn Gly Tyr Thr Leu Glu Asp Leu Phe Ala Thr He Lys Tyr Tyr 
675 680 685 

Val Glu His Pro Asp Glu Arg Pro His Ser Asn Asp Gly Trp Gly Asn 
690 695 700 

Ala Ser Glu His Val Leu Gly Lys Lys Asp His Ser Glu Asp Pro Asn 
705 710 715 720 

Lys Asn Phe Lys Ala Asp Glu Glu Pro Val Glu Glu Thr Pro Ala Glu 
725 730 735 

Pro Glu Val Pro Gin Val Glu Thr Glu Lys Val Glu Ala Gin Leu Lys 
740 745 750 

Glu Ala Glu Val Leu Leu Ala Lys Val Thr Asp Ser Ser Leu Lys Ala 
755 760 765 

Asn Ala Thr Glu Thr Leu Ala Gly Leu Arg Asn Asn Leu Thr Leu Gin 
770 775 780 

He Met Asp Asn Asn Ser He Met Ala Glu Ala Glu Lys Leu Leu Ala 
785 790 795 800 

Leu Leu Lys Gly Ser Asn Pro Ser Ser Val Ser Lys Glu Lys He Asn 
805 810 815 

Lys Leu Asn 



<210> 9 
<211> 2451 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 9 

atgaaaatta ataagaaata ccttgttggt tctgcggcag ctttgatttt aagtgtttgt 60 
tcttacgagt tgggactgta tcaagctaga acggttaagg aaaataatcg tgtttcctat 120 
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atagatggaa aacaagcgac gcaaaaaacg 
cgtgaaggaa tcaatgctga gcaaatcgtc 
tcacatggcg accactatca ttattacaat 
gaagaattac tcatgaaaga tccaaactat 
gtcaagggtg gatatgttat caaggtagat 
gcccacgcgg ataacgtccg tacaaaagag 
caacatcgtg aaggtggaac tccaagaaac 
ggacgctata ctacagatga tggttatatc 
ggtgatgctt atatcgttcc tcatggagat 
tcagctagcg agttggctgc tgcagaagcc 
tcaagaacct atcgccgaca aaatagcgat 
gtaagcaatc caggaactac aaatactaac 
gcaagtcaaa gtaatgacat tgatagtctc 
caacgacatg tagaatctga tggccttgtc 
gctagaggtg ttgcagtgcc acacggagat 
tctgaattgg aagaacgaat cgctcgtatt 
gtaccagatt caaggccaga acaaccaagt 
ccgcaacctg caccaaatct taaaatagac 
cgaaaagttg gggaaggata tgtattcgaa 
aaagatttac catctgaaac tgttaaaaat 
gtttcacaca ctttaactgc taaaaaagaa 
gataaagcat ataatctgtt aactgaggct 
aattctgatt tccaagcctt agacaaatta 
aaagaaaaat tggtagatga tttattggca 
cttggcaaac caaattctca aattgagtat 
gctgataagt atacaacgtc agatggttac 
gaaggagatg catatgtaac gcctcatatg 
ctttctgata aggaaaaagt tgcagctcaa 
ccatctccag acgcagatgt taaagcaaat 
aatcgtgtga aaggggaaaa acgaattcca 
acagttgagg ttaaaaacgg taatttgatt 
aaatttgctt ggtttgatga tcacacatac 
ttgtttgcga cgattaagta ctacgtagaa 
9gatggggca atgccagtga gcatgtgtta 
aagaacttca aagcggatga agagccagta 
caagtagaga ctgaaaaagt agaagcccaa 
gtaacggatt ctagtctgaa agccaatgca 
ttgactcttc aaattatgga taacaatagt 
ttgttaaaag gaagtaatcc ttcatctgta 

<210> 10 

<211> 819 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 10 

Met Lys lie Asn Lys Lys Tyr Leu 1 
1 5 



gagaatttga ctcctgatga ggttagcaag 18 0 
atcaagataa cagaccaagg ctatgtcact 24 0 
ggtaaggttc cttatgacgc tatcatcagt 3 00 
aagctaaaag atgaggatat tgttaatgag 360 
ggaaaatact atgtttacct taaggatgct 420 
gaaatcaatc gacaaaaaca agagcatagt 480 
gatggtgctg ttgccttggc acgttcgcaa 54 0 
tttaatgctt ctgatatcat agaggatact 60 0 
cattaccatt acattcctaa gaatgagtta 660 
ttcctatctg gtcgaggaaa tctgtcaaat 720 
aacacttcaa gaacaaactg ggtaccttct 78 0 
acaagcaaca acagcaacac taacagtcaa 84 0 
ttgaaacagc tctacaaact gcctttgagt 900 
tttgatccag cacaaatcac aagtcgaaca 960 
cattaccact tcatccctta ctctcaaatg 1020 
attccccttc gttatcgttc aaaccattgg 1080 
ccacaaccga ctccggaacc tagtccaggc 114 0 
tcaaattctt ctttggttag tcagctggta 1200 
gaaaagggca tctctcgtta tgtctttgcg 1260 
cttgaaagca agttatcaaa acaagagagt 132 0 
aatgttgctc ctcgtgacca agaattttat 1380 
cataaagcct tgtttgnaaa taagggtcgt 144 0 
ttagaacgct tgaatgatga atcgactaat 150 0 
ttcctagcac caattaccca tccagagcga 1560 
actgaagacg aagttcgtat tgctcaatta 1620 
atttttgatg aacatgatat aatcagtgat 1680 
ggccatagtc actggattgg aaaagatagc 1740 
gcctatacta aagaaaaagg tatcctacct 1800 
ccaactggag atagtgcagc agctatttac 1860 
ctcgttcgac ttccatatat ggttgagcat 1920 
attcctcata aggatcatta ccataatatt 1980 
aaagctccaa atggctatac cttggaagat 2040 
caccctgacg aacgtccaca ttctaatgat 2100 
ggcaagaaag accacagtga agatccaaat 2160 
gaggaaacac ctgctgagcc agaagtccct 2220 
ctcaaagaag cagaagtttt gcttgcgaaa 2280 
acagaaactc tagctggttt acgaaataat 234 0 
atcatggcag aagcagaaaa attacttgcg 2400 
agtaaggaaa aaataaacta a 2451 



a Gly Ser Val Ala Val Leu Ala 
10 15 
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Leu Ser Val Cys Ser Tyr Glu Leu Gly Arg Tyr Gin Ala Gly Gin Asp 
20 25 30 

Lys Lys Glu Ser Asn Arg Val Ala Tyr lie Asp Gly Asp Gin Ala Gly 
35 40 45 

Gin Lys Ala Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly 
50 55 60 

lie Asn Ala Glu Gin lie Val He Lys He Thr Asp Gin Gly Tyr Val 
65 70 75 80 

Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr 
85 90 95 

Asp Ala He He Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gin 
100 105 110 

Leu Lys Asp Ser Asp He Val Asn Glu He Lys Gly Gly Tyr Val He 
115 120 125 

Lys Val Asn Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala 
130 135 140 

Asp Asn He Arg Thr Lys Glu Glu He Lys Arg Gin Lys Gin Glu Arg 
145 150 155 160 

Ser His Asn His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg 
165 170 175 

Ala Gin Gly Arg Tyr Thr Thr Asp Asp Gly Tyr He Phe Asn Ala Ser 
180 185 190 

Asp He He Glu Asp Thr Gly Asp Ala Tyr He Val Pro His Gly Asp 
195 200 205 

His Tyr His Tyr He Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala 
210 215 220 

Ala Ala Glu Ala Tyr Trp Asn Gly Lys Gin Gly Ser Arg Pro Ser Ser 
225 230 235 240 

Ser Ser Ser Tyr Asn Ala Asn Pro Ala Gin Pro Arg Leu Ser Glu Asn 
245 250 255 

His Asn Leu Thr Val Thr Pro Thr Tyr His Gin Asn Gin Gly Glu Asn 
260 265 270 



14 



WO 00/37105 



PCT7US99/30390 



lie Ser Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg 
275 280 285 

His Val Glu Ser Asp Gly Leu He Phe Asp Pro Ala Gin He Thr Ser 
290 295 300 

Arg Thr Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe 
305 310 315 320 

He Pro Tyr Glu Gin Met Ser Glu Leu Glu Lys Arg He Ala Arg He 
325 330 335 

He Pro Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro 
340 345 350 

Glu Glu Pro Ser Pro Gin Pro Thr Pro Glu Pro Ser Pro Ser Pro Gin 
355 360 365 

Pro Ala Pro Ser Asn Pro He Asp Gly Lys Leu Val Lys Glu Ala Val 
370 375 380 

Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser Arg 
385 390 395 400 

Tyr He Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala Gly He Asp 
405 410 415 

Ser Lys Leu Ala Lys Gin Glu Ser Leu Ser His Lys Leu Gly Thr Lys 
420 425 430 

Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala Tyr 
435 440 445 

Asp Leu Leu Ala Arg He His Gin Asp Leu Leu Asp Asn Lys Gly Arg 
450 455 460 



Gin Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys Asp 
465 470 475 



480 



Val Ser Ser Asp Lys Val Lys Leu Val Glu Asp He Leu Ala Phe Leu 
485 490 495 

Ala Pro He Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ala Gin He 
500 505 510 



Thr Tyr Thr Asp Asp Glu He Gin Val Ala Lys Leu Ala Gly Lys Tyr 
515 520 525 
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Thr Ala Glu Asp Gly Tyr lie Phe Asp Pro Arg Asp lie Thr Ser Asp 
530 535 540 



Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser His Trp He 
545 550 555 



560 



Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gin Ala Tyr 



565 



570 



575 



Ala Glu Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His Gin Asp Ser 
580 585 590 

Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala He Tyr Asn Arg Val Lys 
595 600 605 

Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gin Tyr 
610 615 620 

Thr Val Glu Val Lys Asn Gly Ser Leu He He Pro His Tyr Asp His 
625 630 635 640 

Tyr His Asn He Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu Ala 
64 5 650 655 

Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr Tyr 
^60 665 670 

Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly Asn 
675 680 685 



Ala Ser Asp His Val Gin Arg Asn Lys Asn Gly Gin Ala Asp Thr Asn 
690 695 



700 



Gin Thr Glu Lys Pro Ser Glu Glu Lys Pro Gin Thr Glu Lys Pro Glu 
705 710 



715 



720 



Glu Glu Thr Pro Arg Glu Glu Lys Pro Gin Ser Glu Lys Pro Glu Ser 
725 730 735 

Pro Lys Pro Thr Glu Glu Pro Glu Glu Ser Pro Glu Glu Ser Glu Glu 
740 745 750 

Pro Gin Val Glu Thr Glu Lys Val Glu Glu Lys Leu Arg Glu Ala Glu 
755 "760 765 



Asp Leu Leu Gly Lys He Gin Asp Pro He He Lys Ser Asn Ala Lys 
770 77 5 780 
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Glu Thr Leu Thr 
785 

Asn Asn Thr lie 
Glu Ser Lys 



Gly Leu Lys Asn 
790 

Met Ala Glu Ala 
805 



Asn Leu Leu Phe 
795 

Glu Lys Leu Leu 
810 



Gly Thr Gin Asp 
800 

Ala Leu Leu Lys 
815 



<210> 11 

<211> 2531 

<212> DNA 

<213> Streptococcus pneumoniae 



<400> 11 

atgaaaatta ataaaaaata tctagcaggt 
tcctatgagc ttggacgtta ccaagctggt 
tatatagatg gtgatcaggc tggtcaaaag 
aagagggagg ggatcaacgc cgaacaaatt 
acctctcatg gagaccatta tcattactat 
agtgaagagc tcctcatgaa agatccgaat 
gaaatcaagg gtggttatgt cattaaggta 
gcrgctcatg cggataatat tcggacaaaa 
agtcataatc ataactcaag agcagataat 
tatacaacgg atgatgggta tatcttcaat 
gcttatatcg ttcctcacgg cgaccattac 
agcgagttag ctgctgcaga agcctattgg 
agttctagtt ataatgcaaa tccagctcaa 
gtcactccaa cttatcatca aaatcaaggg 
tatgctaaac ccttatcaga acgccatgtg 
caaatcacaa gtcgaaccgc cagaggtgta 
atcccttatg aacaaatgtc tgaattggaa 
tatcgttcaa accattgggt accagattca 
ccagaaccta gtccaagtcc gcaaccagct 
aaagaagctg ttcgaaaagt aggcgatggt 
tatatcccag ccaaggatct ttcagcagaa 
aagcaggaaa gtttatctca taagctagga 
cgagaatttt acaataaggc ttatgactta 
aataaaggtc gacaagttga ttttgaggct 
gtctcaagtg ataaagtcaa gttagtggaa 
catccagaac gtttaggaaa accaaatgcg 
gtagccaagt tggcaggcaa gtacacagca 
ataaccagtg atgaggggga tgcctatgta 
aaaaaagata gtttgtctga agctgagaga 
ggtttgaccc ctccttcgac agaccatcag 
gaagctatct acaaccgmgt gaaagcagct 



tcagtggcag tccttgccct aagtgtttgt 60 
caggataaga aagagtctaa tcgagttgct 120 
gcagaaaact tgacaccaga tgaagtcagt 18 0 
gttatcaaga ttacggatca aggttatgtg 24 0 
aatggcaagg ttccttatga tgccatcatc 3 00 
tatcagttga aggattcaga cattgtcaat 360 
aacggtaaat actatgttta ccttaaggat 420 
gaagagatta aacgtcagaa gcaggaacgc 480 
gctgttgctg cagccagagc ccaaggacgt 540 
gcatctgata tcattgagga cacgggtgat 600 
cattacattc ctaagaatga gttatcagct 660 
aatgggaagc agggatctcg tccttcttca 720 
ccaagattgt cagagaacca caatctgact 780 
gaaaacattt caagcctttt acgtgaattg 840 
gaatctgatg gccttatttt cgacccagcg 900 
gctgtccctc atggtaacca ttaccacttt 960 
aaacgaattg ctcgtattat tccccttcgt 1020 
agaccagaag aaccaagtcc acaaccgact 1080 
ccaagcaatc caattgatgg gaaattggtc 114 0 
tatgtctttg aggagaatgg agtttctcgt 1200 
acagcagcag gcattgatag caaactggcc 1260 
actaagaaaa ctgacctccc atctagtgat 1320 
ctagcaagaa ttcaccaaga tttacttgat 1380 
ttggataacc tgttggaacg actcaaggat 1440 
gatattcttg ccttcttagc tccgattcgt 1500 
caaattacct acactgatga tgagattcaa 1560 
gaagacggtt atatctttga tcctcgtgat 1620 
actccacata tgacccatag ccactggatt 1680 
gcggcagccc aggcttatgc traagagaaa 174 0 
gattcaggaa atactgaggc aaaaggagca 1800 
aagaaggtgc cacttgatcg tatgccttac 1860 
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aatcttcaat atactgtaga agtcaaaaac 
taccataaca tcaaatttga gtggtttgac 
actcttgagg atcttttggc gactgtcaag 
cattcagata atggttttgg taacgctagc 
gctgatacca atcaaacgga aaaaccaagc 
gaagaaaccc ctcgagaaga gaaaccgcaa 
gaggaaccag aagaatcacc agaggaatca 
gaagaaaaac tgagagaggc tgaagattta 
tccaatgcca aagagactct cacaggatta 
aacaatacta ttatggcaga agctgaaaaa 
aggtagaagc ttaagggcga atttggcacc 
gaaaaactat t 



ggtagtttaa tcatacctca ttatgaccat X920 
gaaggccttt atgaggcacc taaggggtat 1980 
tactatgtcg aacatccaaa cgaacgtccg 2040 
gaccatgttc aaagaaacaa aaatggtcaa 2100 
gaggagaaac ctcagacaga aaaacctgag 2160 
agcgagaaac cagagtctcc aaaaccaaca 2220 
gaagaacctc aggtcgagac tgaaaaggtt 2280 
cttggaaaaa tccaggatcc aattatcaag 2340 
aaaaataatt tactatttgg cacccaggac 2400 
ctattggctt tattaaagga gagtaagtaa 2460 
caggacaaca atactattat ggcagaagct 2520 

2531 
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